Software Engineer – SRE (Java Support)
navneetkaur | Updated: March 14, 2026
About Trantor:
Trantor is a technology services company focused on outsourced product development and digital re-engineering. Leveraging our CaptiveCoE™ engagement model, we operate as a seamless extension of our client’s teams to provide rapid scalability with predictable budgets. Founded in 2012, Trantor has worked with customers across Tech, FinTech, Media & Cybersecurity industries. We have centers in the US, India, Canada, and Costa Rica. We are consistently rated as the #1 employer in the region with the ability to attract and retain technical talent. Our commitment to excellence and impactful results has translated to long-term relationships and value for our clients and solution partners.
Job Overview
We are looking for a Software Engineer – Site Reliability Engineering (SRE) with strong Java production support experience to ensure the stability, reliability, and performance of enterprise applications. The role involves monitoring production systems, troubleshooting incidents, performing root cause analysis, and supporting Java-based applications running on AWS infrastructure.
Key Responsibilities
- Provide L2/L3 production support for Java-based applications.
- Monitor application health, system performance, and service availability.
- Investigate and resolve production incidents, bugs, and performance issues.
- Perform root cause analysis (RCA) and implement preventive solutions.
- Work closely with development teams to resolve application defects.
- Analyze application logs and troubleshoot issues across distributed systems.
- Support deployment activities and production releases.
- Ensure system reliability, availability, and uptime as per SLA requirements.
- Participate in on-call support rotations and incident management.
- Maintain documentation for operational procedures and known issues.
Must-Have Skills
- Strong experience in Java production support/application support.
- Experience troubleshooting Java-based microservices applications.
- Hands-on experience with Spring Boot applications.
- Experience with AWS cloud environment.
- Knowledge of SQL and database troubleshooting.
- Experience with log analysis tools (Splunk, ELK, or similar).
- Experience in incident management and root cause analysis.
- Understanding of Linux/Unix environments.
Good-to-Have Skills
- Experience with Docker and Kubernetes.
- Knowledge of CI/CD pipelines.
- Experience with monitoring tools (CloudWatch, Prometheus, Grafana).
- Familiarity with message queues such as Kafka or RabbitMQ.
- Understanding of site reliability and observability practices.


