Today
Secret
Unspecified
Unspecified
Engineering - Mechanical
Heredia, Costa Rica (On-Site/Office)
Introduction
About IBM
IBM is a global technology and innovation company. It is the largest technology and consulting employer in the world, with presence in 170 countries. The diversity and breadth of the entire IBM portfolio of research, consulting, solutions, services, systems and software, unusually distinguishes IBM from other companies in the industry.
Over the past 100 years, a lot has changed at IBM, in this new era of Cognitive Business, IBM is helping to reshape industries as diverse as healthcare, retail, banking, travel, manufacturing, and many more, by bringing together our expertise in Cloud, Analytics, Security, Mobile, and the Internet of Things. We like to say, "be essential." We are changing how we craft. How we collaborate. How we analyze. How we engage.
Join the next generation of innovators, inventors and entrepreneurs who are crafting the very way the world works. We want the brightest minds doing work that encourages, in an environment where growth is supported. IBMers get to discover their potential, so they're inspired to build breakthroughs that help our clients succeed. We're building teams with dynamic strengths with people who want their ideas to matter. Join us - you'll be proud to call yourself an IBMer.
Our Culture:
IBM is committed to crafting a diverse environment and is proud to be an equal opportunity employer. You will receive consideration for employment without regard to your race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.
Business Unit Introduction
IBM Cloud is a one-stop shop which provides all the cloud solutions & cloud tools the industries need. IBM Cloud portfolio includes infrastructure as a service (IaaS), software as a service (SaaS) and platform as a service (PaaS) offered through public, private and hybrid cloud delivery models, in addition to the components that make up those clouds.
IBM Cloud ensures seamless integration into public and private cloud environments. The infrastructure is secure, scalable, and flexible, providing customized enterprise solutions that have made IBM Cloud the Hybrid Cloud Market leader with our market leading IAAS and PAAS Platforms. The IBM Cloud platform is the public cloud offering from IBM providing services to global enterprises. IBM Cloud is the Cloud for Smarter Business, built on Open Technology with Developer Tools and supports solutions by Industry. We run the services and workloads from Watson, Blockchain, Services, Security, and IoT.
Ready to help drive IBM's success in the Cloud market? This is your chance to research and learn new Cloud related technology products and services, as well as to design and implement quick Cloud based prototypes while advancing your career in leading edge technology.
Who you are:
As a site reliability engineer and operations manager (SRE) in the IBM Cloud Infrastructure organization, you will be responsible for managing and leading a team of SRE engineers. Responsibilities include ensuring the reliability, scalability, and operational efficiency of IBM Cloud's storage services. You will do the hiring, training, and mentoring team members, assigning tasks, setting goals, and conducting performance evaluations. You will work closely with development teams, SRE peers and engineering managers to automate infrastructure management, optimize system performance, and enhance monitoring capabilities. This role involves writing code, building automation, troubleshooting production issues, and improving overall service reliability. Overall, an SRE Manager plays a crucial role in aligning engineering and operations to achieve reliable software systems. Combine technical expertise with leadership and management skills to drive continuous improvement and ensure high-quality service delivery.
Your role and responsibilities
Key Responsibilities:
Leadership
Reliability & Scalability
Monitoring & Observability
Automation & Infrastructure as Code (IaC)
Incident Management
Security & Compliance
Cross-Team Collaboration & DevOps Culture
Required education
Bachelor's Degree
Preferred education
Bachelor's Degree
Required technical and professional expertise
Required Skills & Experience:
Technical Skills
Experience
Required Education and Experience
• Bachelor's degree in computer science engineering/information technology.
• 5+ years.
ABOUT BUSINESS UNIT
IBM Systems helps IT leaders think differently about their infrastructure. IBM servers and storage are no longer inanimate - they can understand, reason, and learn so our clients can innovate while avoiding IT issues. Our systems power the world's most important industries and our clients are the architects of the future. Join us to help build our leading-edge technology portfolio designed for cognitive business and optimized for cloud computing.
YOUR LIFE @ IBM
In a world where technology never stands still, we understand that, dedication to our clients success, innovation that matters, and trust and personal responsibility in all our relationships, lives in what we do as IBMers as we strive to be the catalyst that makes the world work better.
Being an IBMer means you'll be able to learn and develop yourself and your career, you'll be encouraged to be courageous and experiment everyday, all whilst having continuous trust and support in an environment where everyone can thrive whatever their personal or professional background.
Our IBMers are growth minded, always staying curious, open to feedback and learning new information and skills to constantly transform themselves and our company. They are trusted to provide on-going feedback to help other IBMers grow, as well as collaborate with colleagues keeping in mind a team focused approach to include different perspectives to drive exceptional outcomes for our customers. The courage our IBMers have to make critical decisions everyday is essential to IBM becoming the catalyst for progress, always embracing challenges with resources they have to hand, a can-do attitude and always striving for an outcome focused approach within everything that they do.
Are you ready to be an IBMer?
ABOUT IBM
IBM's greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.
Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we're also one of the biggest technology and consulting employers, with many of the Fortune 50 companies relying on the IBM Cloud to run their business.
At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it's time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.
OTHER RELEVANT JOB DETAILS
For additional information about location requirements, please discuss with the recruiter following submission of your application.
About IBM
IBM is a global technology and innovation company. It is the largest technology and consulting employer in the world, with presence in 170 countries. The diversity and breadth of the entire IBM portfolio of research, consulting, solutions, services, systems and software, unusually distinguishes IBM from other companies in the industry.
Over the past 100 years, a lot has changed at IBM, in this new era of Cognitive Business, IBM is helping to reshape industries as diverse as healthcare, retail, banking, travel, manufacturing, and many more, by bringing together our expertise in Cloud, Analytics, Security, Mobile, and the Internet of Things. We like to say, "be essential." We are changing how we craft. How we collaborate. How we analyze. How we engage.
Join the next generation of innovators, inventors and entrepreneurs who are crafting the very way the world works. We want the brightest minds doing work that encourages, in an environment where growth is supported. IBMers get to discover their potential, so they're inspired to build breakthroughs that help our clients succeed. We're building teams with dynamic strengths with people who want their ideas to matter. Join us - you'll be proud to call yourself an IBMer.
Our Culture:
IBM is committed to crafting a diverse environment and is proud to be an equal opportunity employer. You will receive consideration for employment without regard to your race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.
Business Unit Introduction
IBM Cloud is a one-stop shop which provides all the cloud solutions & cloud tools the industries need. IBM Cloud portfolio includes infrastructure as a service (IaaS), software as a service (SaaS) and platform as a service (PaaS) offered through public, private and hybrid cloud delivery models, in addition to the components that make up those clouds.
IBM Cloud ensures seamless integration into public and private cloud environments. The infrastructure is secure, scalable, and flexible, providing customized enterprise solutions that have made IBM Cloud the Hybrid Cloud Market leader with our market leading IAAS and PAAS Platforms. The IBM Cloud platform is the public cloud offering from IBM providing services to global enterprises. IBM Cloud is the Cloud for Smarter Business, built on Open Technology with Developer Tools and supports solutions by Industry. We run the services and workloads from Watson, Blockchain, Services, Security, and IoT.
Ready to help drive IBM's success in the Cloud market? This is your chance to research and learn new Cloud related technology products and services, as well as to design and implement quick Cloud based prototypes while advancing your career in leading edge technology.
Who you are:
As a site reliability engineer and operations manager (SRE) in the IBM Cloud Infrastructure organization, you will be responsible for managing and leading a team of SRE engineers. Responsibilities include ensuring the reliability, scalability, and operational efficiency of IBM Cloud's storage services. You will do the hiring, training, and mentoring team members, assigning tasks, setting goals, and conducting performance evaluations. You will work closely with development teams, SRE peers and engineering managers to automate infrastructure management, optimize system performance, and enhance monitoring capabilities. This role involves writing code, building automation, troubleshooting production issues, and improving overall service reliability. Overall, an SRE Manager plays a crucial role in aligning engineering and operations to achieve reliable software systems. Combine technical expertise with leadership and management skills to drive continuous improvement and ensure high-quality service delivery.
Your role and responsibilities
Key Responsibilities:
Leadership
- Provide strategic guidance to engineering teams on architectural decisions and directions.
- Empower teams to achieve technical excellence, with a focus on reliability, scalability, and simplicity.
- Foster collaboration across engineering, product, and other cross-functional teams to deliver optimal solutions.
Reliability & Scalability
- Design and build highly available, distributed services, focusing on scalability, security, and performance.
- Implement Kubernetes and OpenShift-based solutions for managing containerized applications at scale.
- Develop auto-scaling, load balancing, and failover strategies to ensure seamless service availability and resilience.
Monitoring & Observability
- Design and implement monitoring solutions to gain insights into system health, performance, and reliability.
- Build and maintain intuitive dashboards for real-time visibility into critical system metrics.
- Set up proactive alerting mechanisms to detect and resolve issues before they impact end users.
Automation & Infrastructure as Code (IaC)
- Develop automation scripts using tools such as Terraform and Ansible to streamline infrastructure management.
- Automate operational tasks to improve system reliability and reduce manual intervention.
- Implement and optimize CI/CD pipelines for deploying applications in Kubernetes and OpenShift environments.
Incident Management
- Lead incident response, performing root cause analysis (RCA) and implementing long-term fixes to improve system resilience.
- Build observability solutions with monitoring, logging, and alerting using tools like Prometheus, Grafana, Splunk, and IBM Cloud Monitoring.
- Define and monitor Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs) to ensure service reliability.
Security & Compliance
- Ensure compliance with security best practices and regulatory requirements across all infrastructure components.
- Implement secret management, encryption, and access control for sensitive systems and data.
- Lead security audits, vulnerability assessments, and compliance automation efforts.
Cross-Team Collaboration & DevOps Culture
- Collaborate closely with development, operations, and security teams to design and implement resilient architectures.
- Promote DevOps/SRE best practices, such as blameless postmortems, incident retrospectives, and operational readiness reviews.
- Mentor junior engineers and contribute to knowledge sharing across teams to build a strong DevOps culture.
Required education
Bachelor's Degree
Preferred education
Bachelor's Degree
Required technical and professional expertise
Required Skills & Experience:
Technical Skills
- Programming Languages: Go, Python, Bash, and other scripting languages for automation and tool development.
- Cloud & Infrastructure: Expertise in Kubernetes, OpenShift, Docker, IBM Cloud and other cloud platforms.
- Storage Technologies: Cloud storage solutions.
- CI/CD & Automation: Proficiency with GitHub Actions, Jenkins, Ansible continuous integration and delivery.
- Monitoring & Logging: Hands-on experience with Prometheus, Grafana or other similar tools for system observability.
Experience
- Experience in DevOps, SRE, or related roles, with a strong focus on cloud-based infrastructure and automation.
- Cloud Infrastructure: Advanced understanding of cloud operations, including designing scalable and resilient architectures.
- Linux Expertise: Proficiency in Linux shell scripting and deep understanding of Linux internals.
- Container Management: Expertise in Docker, Kubernetes, and OpenShift, with strong skills in OpenShift 4.x administration, custom operators, Helm charts, and multicluster management.
- Security & Compliance: Significant experience in automating security practices, including secret management, IAM roles and policies, and implementing encryption strategies.
- Disaster Recovery & BCDR: Expertise in designing disaster recovery strategies, business continuity planning, and conducting BCDR simulations with multi-region failover, backup automation, and data redundancy.
- CI/CD Pipeline Management: Leadership experience in implementing and managing CI/CD pipelines using Jenkins, GitHub Actions, or GitLab CI, and automating application deployment with Kubernetes and OpenShift.
- Logging & Monitoring Expertise: Deep experience in implementing and managing centralized logging systems and monitoring solutions to ensure system observability, alerting, and performance tuning.
Required Education and Experience
• Bachelor's degree in computer science engineering/information technology.
• 5+ years.
ABOUT BUSINESS UNIT
IBM Systems helps IT leaders think differently about their infrastructure. IBM servers and storage are no longer inanimate - they can understand, reason, and learn so our clients can innovate while avoiding IT issues. Our systems power the world's most important industries and our clients are the architects of the future. Join us to help build our leading-edge technology portfolio designed for cognitive business and optimized for cloud computing.
YOUR LIFE @ IBM
In a world where technology never stands still, we understand that, dedication to our clients success, innovation that matters, and trust and personal responsibility in all our relationships, lives in what we do as IBMers as we strive to be the catalyst that makes the world work better.
Being an IBMer means you'll be able to learn and develop yourself and your career, you'll be encouraged to be courageous and experiment everyday, all whilst having continuous trust and support in an environment where everyone can thrive whatever their personal or professional background.
Our IBMers are growth minded, always staying curious, open to feedback and learning new information and skills to constantly transform themselves and our company. They are trusted to provide on-going feedback to help other IBMers grow, as well as collaborate with colleagues keeping in mind a team focused approach to include different perspectives to drive exceptional outcomes for our customers. The courage our IBMers have to make critical decisions everyday is essential to IBM becoming the catalyst for progress, always embracing challenges with resources they have to hand, a can-do attitude and always striving for an outcome focused approach within everything that they do.
Are you ready to be an IBMer?
ABOUT IBM
IBM's greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.
Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we're also one of the biggest technology and consulting employers, with many of the Fortune 50 companies relying on the IBM Cloud to run their business.
At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it's time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.
OTHER RELEVANT JOB DETAILS
For additional information about location requirements, please discuss with the recruiter following submission of your application.
group id: 90615168
There is no other company like IBM and there is no business professional like the IBMer. We are experts in nearly every technical scientific and business field. We are citizens of, and apply our expertise in, more than 170 countries. Yet we are united by a single purpose: to be essential. IBMers change how the world works. Join us at IBM Consulting and embrace your passion to make a difference.