We are looking for a skilled Systems Administrator to join our team in Washington, District of Columbia. This role involves managing and optimizing IT infrastructure to ensure seamless operations and security across the organization. The ideal candidate will bring a strong background in cloud platforms, virtualization, and network solutions.<br><br>Responsibilities:<br>• Oversee the administration and maintenance of Microsoft 365, Azure, and other cloud platforms to ensure optimal performance.<br>• Configure and manage virtualization technologies such as VMware and Hyper-V to support organizational needs.<br>• Implement and maintain endpoint security measures to safeguard systems and data.<br>• Collaborate on the design and deployment of new network solutions to enhance operational efficiency.<br>• Install, configure, and manage network hardware, including servers, routers, switches, firewalls, and wireless infrastructure.
<p>Senior Cloud Engineer – Observability & Performance Engineering</p><p>Location: Washington, DC 20549</p><p>Work Arrangement: Fully Onsite</p><p>Clearance Requirement: Ability to obtain and maintain Public Trust</p><p><br></p><p>Position Overview</p><p>We are seeking a highly experienced Cloud Engineer (Observability) to lead the engineering, optimization, and operational maturity of enterprise observability platforms across hybrid cloud and containerized environments.</p><p>This role is ideal for a hands-on engineer with deep expertise in Datadog, distributed tracing, APM, cloud monitoring, performance engineering, and site reliability practices. The successful candidate will partner with infrastructure, cloud, platform, and application teams to improve operational visibility, reduce alert fatigue, accelerate incident resolution, and drive data-informed operational decisions.</p><p><br></p><p>Key Responsibilities</p><p>Observability Platform Engineering</p><ul><li>Engineer and operate enterprise observability solutions including:</li><li>Metrics</li><li>Logs</li><li>Distributed tracing</li><li>APM</li><li>Real User Monitoring (RUM)</li><li>Synthetic monitoring</li><li>Network monitoring</li><li>Build and optimize dashboards, alerts, SLOs, and SLIs</li><li>Implement OpenTelemetry and language-specific instrumentation</li><li>Integrate observability tooling with ServiceNow, CI/CD pipelines, and incident management workflows</li><li>Establish and maintain telemetry tagging standards and governance</li></ul><p>Cloud & Container Monitoring</p><ul><li>Design monitoring solutions for Azure and AWS workloads</li><li>Implement observability for:</li><li>Serverless services</li><li>Managed databases</li><li>Networking</li><li>Identity services</li><li>Cloud-native platforms</li><li>Support Kubernetes and OpenShift monitoring including clusters, nodes, workloads, and service mesh environments</li><li>Develop reusable observability modules using Infrastructure-as-Code</li></ul><p>Performance Engineering & Reliability</p><ul><li>Lead investigation and remediation of performance, latency, reliability, and capacity issues</li><li>Utilize APM, profiling, distributed tracing, and database analytics to identify bottlenecks</li><li>Define trace-based alerting and deployment correlation strategies</li><li>Support major incident response activities and root cause analysis efforts</li></ul><p>Capacity Planning & Operational Excellence</p><ul><li>Analyze telemetry and capacity trends to identify risks and opportunities</li><li>Develop reporting and dashboards for leadership and engineering teams</li><li>Improve alert quality, monitoring coverage, and operational maturity</li><li>Support enterprise SLA, KPI, and availability objectives</li></ul>
<p>We are looking for a Cloud Engineer to help create resilient, modern cloud solutions that support large-scale, event-driven workloads in Northern, Virginia. This position combines technical architecture, hands-on engineering, and close collaboration with stakeholders to deliver secure, high-performing systems. The role is ideal for someone who can balance strategic thinking with practical execution while guiding teams and strengthening client partnerships.</p><p><br></p><p>Responsibilities:</p><p>• Create and refine cloud-native, serverless solutions designed to support high-throughput data processing and event-based applications.</p><p>• Work directly with stakeholders to understand operational goals, identify limitations, and recommend effective technical approaches.</p><p>• Provide technical guidance to decision-makers by evaluating options, outlining risks, and helping resolve complex engineering issues.</p><p>• Build trusted relationships with clients and contribute to team growth through mentorship, collaboration, and technical leadership.</p><p>• Remain actively involved in delivery by participating in architecture reviews, overseeing implementation quality, and developing prototypes when needed.</p><p>• Investigate production issues and optimize system reliability, scalability, and overall performance across cloud environments.</p>
<p>Network Storage Engineer – SAN / Cisco MDS / Nexus</p><p>Location: Washington, DC 20549</p><p>Work Arrangement: Full Onsite</p><p>Clearance Requirement: Ability to obtain and maintain Public Trust</p><p><br></p><p>Position Overview</p><p>We are seeking a highly experienced Network Storage Engineer to support enterprise SAN infrastructure and mission-critical storage network operations within a complex federal environment in Washington, DC.</p><p><br></p><p>This role is responsible for maintaining secure, reliable, and high-performing SAN connectivity across enterprise storage, virtualization, replication, and backup environments. The ideal candidate brings deep expertise in Cisco MDS, Cisco Nexus, SAN administration, and data center operations, along with strong troubleshooting and operational support experience in highly available enterprise environments.</p><p>eKey Responsibilities</p><p>SAN Fabric Administration & Engineering</p><ul><li>Administer and maintain enterprise SAN fabric infrastructure using Cisco MDS and Cisco Nexus platforms</li><li>Configure, deploy, and support SAN connectivity across enterprise storage systems</li><li>Execute SAN-related moves, adds, and changes in accordance with approved change control processes</li><li>Maintain SAN standards, configurations, topology diagrams, and operational documentation</li></ul><p>Connectivity & Performance Operations</p><ul><li>Manage SAN connectivity supporting:</li><li>Enterprise storage</li><li>Virtualization platforms</li><li>Replication services</li><li>Backup and recovery operations</li><li>Monitor SAN network health, availability, and performance</li><li>Perform capacity planning and performance tuning activities</li><li>Produce operational and performance reporting as required</li></ul><p>Incident Management & Troubleshooting</p><ul><li>Troubleshoot SAN connectivity and performance issues across enterprise environments</li><li>Coordinate incident response and escalations with storage, server, and vendor teams</li><li>Conduct root cause analysis and implement corrective actions</li><li>Participate in operational surge support and on-call response activities</li></ul><p>Compliance & Operational Readiness</p><ul><li>Ensure SAN operations align with enterprise security and compliance standards</li><li>Support disaster recovery and continuity objectives</li><li>Maintain audit-ready SOPs, implementation records, and infrastructure documentation</li><li>Collaborate with enterprise infrastructure and operations teams for integrated support</li></ul>