Â
Job Title : Site Reliability Engineer
Location:Â Â Bartlesville, OK or Houston, TX-Â 5 day a week on-site position
Â
Rate: open to market
Â
We need 2 genuine professional references with LinkedIn profile and updated resume
Â
The candidate must have senior level experience deploying OpenShift on premises and supporting applications in Kubernetes. The ideal candidate will have experience in both on-prem OpenShift and Azure Kubernetes container platforms.
Â
The successful candidate will possess strong infrastructure and developer background as well as interpersonal skills needed to communicate design requirements and objectives while providing thought leadership to peers and leadership.
Â
Candidates should be self-motivated and collaborative IT professionals with a strong background in software development, systems administration and IT automation.
Â
Responsibilities:
Â
* Maintaining survivability and reliability of IT/OT critical resources.
* Write and build CI/CD pipelines and build/release processes for IT/OT workflow applications.
* Provide mentoring to the IT/OT Devops team in the best practices associated with CI/CD deployments using ADO, and GIT.
* Perform periodic load and scalability testing to establish baselines, drift, and capacity planning.
* Conduct weekly operational state reviews covering performance trends, anomalies, errors, and other availability events with SREs, product owners, and development teams.
* Participate in quarterly business and operational reviews aligning on roadmaps, development velocity, efficiency, growth trends, patching, etc.
* Plan and execute periodic Disaster Recovery exercises including both tabletop and simulated failures (fault injection).
Â
Required Qualifications
Â
* Candidates must have a bachelor’s degree and 7 years of IT experience.
* Senior level experience with OpenShift and Kubernetes.
* Familiarity with continuous integration/deployment processes and tools such as IDEs (Eclipse), Source Code management. (GIT/Stash), ADO Pipelines, Maven, Nexus artifacts, etc.
* Strong understanding of SRE practices: incident response, change/release management, capacity planning, infrastructure automation, elastic environments, chaos engineering and blameless postmortems.
* Expertise in application performance monitoring, observability, and proactive alert correlation, including monitoring containers and failure-based alerting.
* Scripting experience such as Python and Bash
* Experienced in deploying applications in OpenShift in both public and private cloud.
* Excellent written and oral communications skills
* Demonstrated ability to communicate to nontechnical audience on technical issues.
* Demonstrated ability to communicate on a technical level to a technical audience.
* Strong interpersonal skills, adaptable and able to learn quickly.
* Requires limited supervision and have excellent time management skills.
* Self-motivated and self-starter.
* Ability to work and interact with others in a structured/team environment.
Â
Technology Stack
Â
Experience with at least one technology in each of the tech stack categories below:
Â
* Monitoring and Logging Tools(s): AppDynamics, Splunk, ELK Stack, DataDog, Prometheus, AWS CloudWatch/X-Ray, Grafana
* Programming: C# .NET, PowerShell, Python, YAML
* Containers: Docker, Helm Chart
* Code Repos: Azure Repos, GitHub, GitLab
* Infrastructure as code: Terraform, Ansible
* Automation Tools: Ansible,Jenkins, Chef, Puppet
* Agile: JIRA, SAFe
Â
Desired Qualifications
Â
* Experience in cloud/virtual technologies and management – OpenShift, VMware, AWS, Azure, etc.
*Familiarity with security best practices for containerized applications.
* Knowledge of DevOps practices and tools.
* Knowledge, skills and abilities to automate the creation of Platform as a Services (PaaS) infrastructure using industry standard tools such as Ansible and Chef.
* Familiarity with Industrial Control System (ICS) security architecture – Purdue model.
Â
Regards
Felina Niroshlinfelina@amtexenterprises.us
Amtex Enterprises, Inc 3080 Olcott Street, Unit B245, Santa Clara, CA 95054
—
——————– US STAFFING ESSENTIALS ————————————–
For daily US JOBS / Updated Hotlist / Post hotlist / Vendor Lists from the trusted sourcesÂ
For Everything in US Staffing Just Search on google ” C2C HOTLIST ” for daily 5000+ US JOBS and Updated 10000+ Hotlists.
Have you Checked this No.1 US Staffing Whatsapp Channel for Daily C2C Jobs/ Hotlists and Top US staffing Telegram Channel of 50k American vendors