Site Reliability Engineer


Elite Technical is seeking a Site Reliability Engineer in the Washington DC, Maryland and/or Virginia area for a long term contract position with our customer in Reston VA (Hybrid position, 1x onsite per week in Reston VA) Roles & Responsibilities:

- Communicates Architectural decisions, plans, goals, and strategies, while highlighting short-term trade-offs vs. long-term commitments and costs
- Engage in and improve the end-to-end Lifecycle of services, starting from Inception & design, deployment, and operations.
- Establish automation capabilities leveraging Cloud native solutions, to improve the Developer experience.
- Support activities, including System design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
- Willingness to roll up the sleeves and troubleshoot difficult issues and engage the Customer.
- Willingness to learn new AWS Services and other technologies as required.
- Systems Scalability and sustainability leveraging automation and strive to improve our systems with changes that improve reliability and velocity.
- Experience with Enterprise Cloud transformation and migration efforts.
- Actively participate and help guide customers on using Cloud-native design and architecture patterns.
- Provide Consultation on Technology infrastructure planning and engineering for assigned systems; Assesses the implications of technology strategies on infrastructure capabilities.
- Establish strategies to migrate Legacy applications by conversion to multiple Microservices and hosting on AWS Cloud platform.
- Leverage Cloud-native architecture components including Containers, immutable infrastructure, Microservices, Service Mesh etc., to build highly available and Fault tolerant applications.
- Conduct research on the global technology trends and their applicability to FEPOC products in support of our internal development teams and business initiatives.
- Promotes and ensures Modern application design, applies engineering best practices in the development and operations life cycle and mitigates vulnerabilities.
- Monitors and manages the Stability, Availability, and Performance of enterprise systems and platforms across IT domains.
o (e.g., Systems, Network, Storage, Security) by analyzing systems to identify problems, trends, and opportunities for improvement.
- Automate end to end process to maintain (patches and upgrades) of our AWS Cloud ecosystem.
- Makes data-driven recommendations and decisions and continuously improves the overall efficacy and efficiency of our software delivery capabilities.
- Mentoring peers as well as engaging with others across teams and socializing solutions.

Required Skills


- Minimum of One AWS certification is required.
- Minimum of 10years of IT experience of which at least 5 years must be in AWS Cloud
- Platform engineering and Administration.
- Strong Leadership experience with driving Transformation initiatives
- 3-5 years of experience in a Site Reliability Engineering role
- Experience with SRE principles and transformation
- 3+ years of experience with Containerization (Kubernetes), Cloud technologies (AWS, Azure etc.), DevOps tool chain (Ansible, Jenkins, Artifactory, bitbucket, etc.), and technical patterns (IaC, Automated Provisioning/Release, CI/CD, etc.)
- Solid understanding of Software coding techniques and experience with full spectrum of Software engineering (Build, Integration, Test, Releasing and Deployment) leveraging Python.
- Experience in Developing and/or challenging engineering solutions/practices and collaborating with peers within and outside of immediate team, including customers (Dev, Architects, Engineers)
- Platform Engineering Lead with Hands -on Experience: Building robust Middleware Environments, previous Linux System administration is required.
- Must have strong hands-on knowledge of AWS platform and services but not limited to VPC, Networking, Direct Connect, Subnets, NACLs, Security Groups, EC2, S3, IAM, ELBs, Lambda, CloudWatch, CloudTrail, EKS etc.
- Must Have Hands on current Implementation and Production level experience in AWS Cloud.
- Hands on experience with Automation and Infrastructure Provisioning is a must
- Our goal is to only provision infrastructure with Code, and Policy As Code.
- Must be familiar with Terraform automation, Ansible playbooks, and Python code.
- Experience with AWS Cloud Formation and CDK is required.
- Must have hands on experience in writing Lambda functions preferably in Python (Boto3).
- Must be well versed in writing Linux Bash scripts.
- Hands-on experience with Containerization and Amazon EKS is a big plus.
- A great understanding of various DevOps toolchains, including Git/repo, Crucible, Jenkins etc.
- Solid understanding and experience with a CI/CD tool chain.

Apply Now

Return to Search Results

Have a Question?

Location

Hybrid/Reston, VA

Openings

1

Anticipated Start Date

Monday, April 13, 2026

Job Type

Contract

Anticipated Duration

12 months+

Date Posted

Friday, March 20, 2026

Know someone who would be a good fit? We pay for referrals!

Share this job:



Call 800-ELITE-50
Reference #12415

Elite Technical Services, Inc. participates in the E-Verify program to confirm the employment eligibility of all persons hired. This means that we will provide the Social Security Administration (SSA) and, if necessary, the Department of Homeland Security (DHS), with information from each new employee's Form I-9 to confirm work authorization. Elite Technical Services, Inc. will not use E-Verify to pre-screen job applicants.

Elite Technical Services, Inc. is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law.