Full Job Description
Why Kyndryl
We design, build, manage and modernize the mission-critical technology systems that the world depends on every day. Our people are at the center, discovering, co-creating, and strengthening. We push ourselves and each other to seek better, to go further, and we carry this energy to our customers. In October 2020, IBM announced it’s intention to separate the IT Infrastructure Services unit of its Global Technology Services division into a new, separate public company, creating two industry-leading companies – IBM and Kyndryl. The spin-off is expected to be completed by the end of 2021. To find out more about Kyndryl, including information relating to privacy, please visit Kyndryl.com.
Please be aware Kyndryl will continue to use some IBM systems for a certain period after spin-off. This means when you sign up to either the IBM or Kyndryl candidate portal, you will have the benefit of being able to see and apply for IBM and Kyndryl jobs and to access information about IBM and Kyndryl jobs you have applied to, for a limited period from either candidate portal. If you have already signed up as a candidate on IBM’s portal, please continue to use this account to access IBM and Kyndryl jobs.
Your Role and Responsibilities
Your Role and Responsibilities :
As Service Availability Manager/Site Reliability Engineer, you will focus on the Availability Management for GTS Infrastructure Services Clients and work constantly towards enhancing the Reliability of the estate.
You will optimize the availability of IT infrastructure, systems and services constantly working on improving the reliability of the environment to meet the commitments IBM has made to its clients related to availability target levels in a cost-effective manner.
You will use technical and client environment knowledge to assure services and components are designed and delivered to meet their availability targets. They provide a holistic view of the client’s environment and make recommendations to improve overall service availability. You are required to specialize in reliability with the right mix of knowledge and skills in software and systems, responsible to analyse business needs, problem determination, advise & design, build, test, deploy, changes and maintenance of a well-engineered information system.
Responsibilities
Responsible to develop and maintain Availability Plans which prioritizes and plans IT availability.
Monitor IT availability levels by comparing actual levels against targets and addressing shortfalls.
Good understanding of ITIL and Service Management functions and ability to drive/own SLAs and its management.
Focus on automating the manual IAM tasks using standard tools
System Thinking end-to-end – Broad understanding of enterprise architectures and complex (backend) systems (understand more than the component itself)
Understanding of systems from a reliability perspective. Ability to root cause sources of instability in a high-traffic, distributed system.
Passion for resolving reliability issues and identify strategies to mitigate going forward.
Understanding and practical working experience of operating system / hypervisor internals are familiar with the TCP/IP stack, network routing and load balancing. Experience with configuration and troubleshooting
Monitoring and Event management for complex systems.
Perform multiple roles related to the delivery of Identity and access management automation tools services to IBM, and Client
Responsibilities include Requirement gathering, Crafting the solution, Development, Testing, Implementation / Deployment of the solution, installation of IAM tools, integration with services
IAM tools customization, generation of reports from IAM tools, workflow and policies, authentication / authorization and SSO federation integration, applying security policies, compliance process and security regulations, privileged identity (shared id) management, Maintenance of the tools, and support.
Handle single to multiple accounts as required
Provide domain expertise in specific areas such as logical IAM tools like ISIM, ISAM, IGI, PIM, UAT, MFIM, SiteMinder, Control minder, Centrify, Cyberark and prepare audit readiness and compliance
Required Technical and Professional Expertise
10+ Years of experience in Service Availability Management
Ability to articulate standard methodologies for during implementation
Proven experience in risk-based systems running on Several Server platforms along with OS clustering, writing shell scripting, partitioning and virtualization
Demonstrated experience in handling day-to-day Server and operating system installation, migration and break-fix support
Willingness to work in nights shifts or support 24 x 7 Coverage as per the Business needs
Preferred Technical and Professional Experience
You love collaborative environments that use agile methodologies to encourage creative design thinking and find innovative ways to develop with cutting edge technologies
Ambitious individual who can work under their own direction towards agreed targets/goals and with creative approach to work
Solve difficult engineering problems (and don’t mind getting your hands dirty)
Passionate about automation and innovations that improve productivity by reducing toil
Data-driven / scientific approach to fact-finding and prioritization.
Fair understanding of mathematical and statistical models to assess trends.
Organizational knowledge / Strong communication (verbally and written) / collaboration / negotiation skill, working in a diverse team cross business unit
Intuitive individual with an ability to manage change and proven time management
Proven interpersonal skills while contributing to team effort by accomplishing related results as needed
Up-to-date technical knowledge by attending educational workshops, reviewing publications
Good Software engineering skills (with experience in Python, Go and/or Java(script), Node.js, Angular, NoSQL) is an advantage
Required Education
Bachelor’s Degree
Preferred Education
Master’s Degree
Country/Region
India
State / Province
TAMIL NADU
City / Township / Village
Chennai
Being You @ Kyndryl
Kyndryl is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, pregnancy, disability, age, veteran status, or other characteristics. Kyndryl is also committed to compliance with all fair employment practices regarding citizenship and immigration status.
Other things to know
When applying to jobs of your interest, we recommend that you do so for those that match your experience and expertise. Our recruiters advise that you apply to not more than 3 roles in a year for the best candidate experience.
For additional information about location requirements, please discuss with the recruiter following submission of your application.
Primary job category
Site Reliability Engineer
Role ( Job Role )
Site Reliability Engineering Professional
Employment Type
Full-Time
Contract type
Regular
Position Type
Professional
Travel Required
No Travel
Company
(Y030) Kyndryl Solutions Private Limited
Is this role a commissionable/sales incentive based position?
No