
Contract role
Location: Jersey City, NJ
Onsite: Yes, 05 Days Onsite
Duration: Long Term Contract
Visa: ANY
Type: Long-Term Contract and FTE Both
**Banking/Financial Client’s Experience Mandatory — Do Not Send DevOps Profile. We need SRE
Job Description
The production support consultant role delivers BAU and production support for AWS infrastructure and batch processing workloads, ensuring high availability, operational stability, and on-time completion of scheduled runs.
Support Requirements
Monitoring & Incident Management Continuous monitoring of AWS environments and batch schedules. Rapid triage and resolution of incidents and job failures, including execution of reruns/recoveries within approved procedures.
Coordination & Service Restoration Engage and coordinate with engineering and dependent teams to restore service during complex incidents. Maintain clear, structured communication throughout.
Change & Release Support Support deployment activities including pre/post-implementation checks and rollback execution as needed.
Documentation Create and maintain runbooks, SOPs, troubleshooting guides, and escalation paths to ensure operational continuity across shifts.
SLA Accountability: Ensure critical batch jobs complete within agreed cutoffs and incident response/restoration targets are met.
Actively track and improve through rapid diagnosis.
Reduce repeat incidents through rigorous RCA and implementation of preventive actions.
Required Competencies & Experience
AWS Platform Support Hands-on experience supporting production workloads on AWS.
This includes working knowledge of core services such as EC2, S3, Lambda, CloudWatch, IAM, VPC networking, and the ability to interpret AWS console/CLI outputs for troubleshooting infrastructure issues in real time.
Batch Scheduling & Job Management Solid understanding of enterprise batch scheduling concepts including job dependencies, sequencing, retry logic, processing windows, cutoff management, and failure handling.
Ability to interpret job logs, identify upstream/downstream impacts, and execute controlled reruns.
SQL Proficiency Ability to write and execute basic SQL queries for operational validation and incident troubleshooting.
Python & automation experience.
To apply for this job email your details to abhinav.partap@avanciers.com