<p><strong>Key Responsibilities</strong></p><ul><li>Design and implement <strong>secure, scalable, and highly available Azure cloud architectures</strong>.</li><li>Build and manage Azure infrastructure using <strong>Terraform, ARM templates, and Azure CLI</strong>.</li><li>Provision and support Azure services including <strong>compute, networking, storage, and PaaS</strong> offerings.</li><li>Partner with network and security teams to implement <strong>IAM, network security, and data protection controls</strong>.</li><li>Implement monitoring, logging, and alerting using <strong>Azure Monitor, Log Analytics, and Application Insights</strong>.</li><li>Troubleshoot performance, availability, and reliability issues across Azure environments.</li><li>Automate deployments and operational workflows, including integrations with <strong>ServiceNow APIs</strong>.</li><li>Support CI/CD and infrastructure automation initiatives to improve deployment consistency and efficiency.</li></ul><p><br></p>
<p><u>Network Engineer (NOC )</u></p><p><br></p><p>We are seeking a Network Engineer with strong Network Operations Center (NOC) experience to support and maintain network environments across multiple client infrastructures. This role focuses on monitoring network health, responding to alerts, troubleshooting outages, and ensuring stable and secure connectivity across distributed networks.</p><p>The ideal candidate has experience working in a managed services or multi-client environment, supporting network infrastructure through monitoring platforms and escalation workflows. This role requires someone who is comfortable responding to alerts, diagnosing network incidents, and maintaining uptime across complex environments.</p><p><br></p><p><u>Key Responsibilities</u></p><p><br></p><ul><li>Monitor network infrastructure using NOC monitoring platforms to ensure network availability and performance.</li><li>Respond to alerts and incidents related to network connectivity, access points, routing issues, and firewall events.</li><li>Troubleshoot outages and escalate or resolve issues impacting network services.</li><li>Participate in operational incident response and follow structured troubleshooting methodologies during service disruptions.</li><li>Deploy, configure, and maintain network infrastructure including routers, switches, firewalls, and wireless access points.</li><li>Support network environments across multiple client infrastructures and distributed locations.</li><li>Assist with network upgrades, configuration updates, and infrastructure improvements.</li><li>Assist with implementation and maintenance of network security practices and firewall configurations.</li><li>Support multiple firewall platforms and maintain network segmentation and access control policies.</li><li>Maintain accurate documentation of network configurations, incident response procedures, and troubleshooting steps.</li><li>Provide technical updates and reporting during operational incidents or troubleshooting activities.</li><li>Follow best practices for operational documentation and escalation procedures within the NOC environment.</li><li>Work closely with engineering teams and internal stakeholders to resolve network issues and improve infrastructure reliability.</li><li>Provide technical support and guidance to internal teams and client stakeholders when network incidents occur.</li><li>Stay current on networking technologies, monitoring tools, and operational best practices related to network infrastructure and security.</li></ul><p><br></p><p><br></p>
<p><strong>Software Engineer (Databricks/Data Platform)</strong></p><p><strong>Hybrid 3-4 days onsite in Alpharetta, GA</strong></p><p><strong>Duration through 10/30/26</strong></p><p><br></p><p>We are looking for an experienced Software Engineer III to join our team in Alpharetta, GA. In this role, you will play a critical part in supporting and developing a Databricks-based data platform, focusing on creating scalable and efficient solutions during the development phase. This is a long-term contract position, requiring in-office work three to four days per week.</p><p><br></p><p>Responsibilities:</p><ul><li>Develop and support Databricks notebooks, jobs, and workflows</li><li>Write, optimize, and maintain PySpark and Python code for data processing</li><li>Help design scalable, reliable, and efficient data pipelines</li><li>Apply Spark best practices (partitioning, caching, joins, file sizing)</li><li>Work with Delta Lake tables and data models</li><li>Perform data validation and quality checks during development</li><li>Support cluster configuration and sizing for development workloads</li><li>Identify performance bottlenecks early and recommend improvements</li><li>Collaborate with Data Engineers to ensure solutions are production-ready</li><li>Document development standards, patterns, and best practices</li></ul>
<p><strong>What You Will Own:</strong></p><p><strong>Linux Infrastructure Operations</strong></p><ul><li>Full lifecycle administration of ~222 Linux servers (production, QA, development)</li><li>OS upgrades, patch management, and kernel updates</li><li>Performance monitoring and system tuning (CPU, memory, disk I/O, network)</li><li>User access management and authentication integrations (LDAP/AD)</li><li>Backup validation and disaster recovery readiness</li></ul><p><strong>Kubernetes / OpenShift Platform Ownership</strong></p><ul><li>Deploy, administer, and support <strong>Kubernetes/OpenShift clusters</strong> across environments</li><li>Manage cluster lifecycle: installation, upgrades, patching, and scaling</li><li>Configure and maintain:</li><li>Namespaces, RBAC, and security policies</li><li>Networking (CNI, ingress controllers, load balancing)</li><li>Persistent storage (PVCs, storage classes)</li><li>Support application teams with container deployments, troubleshooting, and performance tuning</li><li>Monitor cluster health using tools like Prometheus, Grafana, and native OpenShift tooling</li><li>Optimize cluster resource utilization and capacity planning</li><li>Implement and maintain CI/CD integrations for containerized workloads</li></ul><p><strong>Security & Hardening</strong></p><ul><li>Implement and maintain patching cadence across Linux and Kubernetes environments</li><li>System hardening aligned to CIS/STIG best practices</li><li>SELinux configuration and enforcement</li><li>Firewall configuration (iptables / firewalld)</li><li>Kubernetes security best practices (RBAC, pod security standards, image scanning)</li><li>Support vulnerability remediation from tools (Tenable, Qualys, etc.)</li><li>Log monitoring and audit review across infrastructure and containers</li></ul><p><strong>Incident Response & Production Stability</strong></p><ul><li>Lead root cause analysis (RCA) for infrastructure and platform incidents</li><li>Participate in on-call support for critical systems and clusters</li><li>Resolve Sev1/Sev2 outages across Linux and Kubernetes environments</li><li>Develop post-incident documentation and preventative controls</li></ul><p><strong>Modernization & Automation</strong></p><ul><li>Assess and remediate deprecated platform components</li><li>Standardize system and cluster configurations</li><li>Build documentation and operational runbooks</li><li>Drive infrastructure-as-code and automation initiatives (Ansible, Terraform, etc.)</li><li>Support migration of legacy workloads to containerized platforms</li></ul><p><strong>UNIX & OS/400 (IBM) Support</strong></p><ul><li>Administer UNIX environments (AIX/Solaris experience preferred)</li><li>Support integrations with IBM i (OS/400) systems supporting Rail operations</li><li>Ensure proper update and lifecycle management across platforms</li><li>Maintain cross-platform data and system dependencies</li></ul>
<p>Robert Half is hiring! We are looking for an experienced Site Reliability Engineer to join our team. This role involves designing, operating, and enhancing a secure, scalable, and cost-efficient multi-cloud platform. The ideal candidate will possess a strong technical background, a passion for automation and observability, and a commitment to improving system reliability and efficiency.</p><p><br></p><p>Responsibilities:</p><p>• Design, implement, and manage reliable and scalable systems across multi-cloud environments, including AWS and Azure.</p><p>• Develop and refine service level objectives (SLOs), service level indicators (SLIs), and error budgets to support system reliability.</p><p>• Lead root cause analyses for incidents and implement measures to prevent recurrence.</p><p>• Enhance platform observability by creating and maintaining metrics, logs, traces, and alerts.</p><p>• Drive cloud cost optimization initiatives by implementing cost visibility, forecasting, and accountability measures.</p><p>• Collaborate with security teams to ensure compliance with regulatory standards and embed security into platform operations.</p><p>• Automate operational workflows using Infrastructure as Code and CI/CD pipelines.</p><p>• Utilize AI tools to improve incident analysis, capacity planning, and operational efficiency.</p><p>• Mentor and guide engineering teams on reliability practices and cost-efficient architectures.</p><p>• Partner with cross-functional teams to influence technical direction and improve operational maturity.</p>
We are looking for an experienced Systems Engineer to join our team in Lawrenceville, Georgia. This role requires a strong background in Citrix technologies, cloud solutions, and Azure Virtual Desktop. The ideal candidate will bring a hands-on approach while also contributing at a senior level to support infrastructure and cloud integration efforts.<br><br>Responsibilities:<br>• Lead the design, implementation, and maintenance of Citrix environments, including Desktop/App, Cloud, and Netscalers.<br>• Manage and optimize Azure Virtual Desktop solutions to enhance system performance and scalability.<br>• Collaborate with IT staff and contractors to ensure seamless infrastructure operations within a robust technical environment.<br>• Oversee Active Directory and Azure Active Directory configurations, ensuring security and reliability.<br>• Administer and troubleshoot Microsoft Windows Server environments to maintain system efficiency.<br>• Integrate Microsoft Exchange services into existing infrastructure, ensuring smooth communication and data management.<br>• Provide strategic guidance and technical expertise to improve cloud integration and infrastructure processes.<br>• Participate in ongoing system assessments to identify areas for improvement and implement solutions.<br>• Act as a senior-level resource for troubleshooting and resolving complex system issues.<br>• Support the transition to a hybrid work setup after the initial onsite period.
<p><strong>Java Developer III</strong></p><p><strong>Based in Alpharetta, GA</strong></p><p><strong>12 month contract - Potential Extensions</strong></p><p><br></p><p>Responsibilities</p><ul><li>Design, build, and maintain Java/Spring Boot microservices handling audio, video, and image uploads</li><li>Integrate backend services with Azure OpenAI and Azure Speech for transcription, text‑to‑speech, and intelligent data extraction</li><li>Implement secure cloud‑based storage and retrieval using Azure Blob Storage</li><li>Optimize system performance, latency, and reliability for both real‑time and batch AI workflows</li><li>Collaborate with frontend teams on media capture, playback APIs, and workflows</li><li>Partner with cloud, security, and governance teams to ensure compliance, security, and data protection</li><li>Participate in end‑to‑end solution ownership from design through deployment and support</li></ul><p><br></p>
<p><strong>Key Responsibilities:</strong></p><p><strong>Solution Design and Development</strong></p><ul><li>Design, develop, and implement customizations, extensions, and integrations within the Dynamics 365 platform to meet client-specific requirements.</li><li>Leverage tools such as X++, C#, JavaScript, Power Platform, Logic Apps, and Azure Functions to build scalable and high-performance solutions.</li><li>Collaborate and ensure development aligns with architectural standards and project goals.</li></ul><p><strong>Code Oversight and Quality Assurance</strong></p><ul><li>Oversee the quality of code developed by the team, conducting code reviews to ensure adherence to best practices, security standards, and scalability requirements.</li><li>Approve and promote code through the development lifecycle, ensuring readiness for deployment.</li><li>Troubleshoot and resolve technical issues, providing guidance to team members to address complex challenges.</li></ul><p><strong>Collaboration with Distributed Teams and Partners</strong></p><ul><li>Collaborate with distributed development resources (onshore/offshore/partners as applicable) to support timely delivery of technical work.</li><li>Maintain clear communication channels across teams to facilitate seamless handoffs, reviews, and resolution of blockers.</li><li>Coordinate expectations for deliverables, timelines, and definition of done, escalating risks to technical leads/project management as needed.</li></ul><p><strong>Collaboration with Project Teams</strong></p><ul><li>Work closely with functional consultants to translate business requirements into technical specifications.</li><li>Support project managers and technical leads in tracking development progress and addressing blockers.</li><li>Participate in design and planning sessions to ensure technical feasibility and alignment with client needs.</li></ul><p><strong>Integration and Customization</strong></p><ul><li>Develop and configure integrations between Dynamics 365 and external systems using APIs, Logic Apps, and Azure services.</li><li>Customize Dynamics 365 modules (Finance, Supply Chain, Customer Engagement, etc.) to align with specific business processes.</li><li>Manage and execute data migrations, ensuring accuracy and security throughout the process.</li></ul><p><strong>Continuous Improvement and Innovation</strong></p><ul><li>Identify opportunities to improve existing codebases and processes to enhance efficiency and maintainability.</li><li>Stay current with the latest Dynamics 365 updates, tools, and best practices, incorporating them into development efforts.</li><li>Contribute to internal knowledge sharing by documenting solutions and providing technical guidance to the team.</li></ul>
We are looking for a Platform Engineer to support and enhance a large-scale Linux and container platform environment for a motor freight forwarder in Johns Creek, Georgia. This Long-term Contract position focuses on maintaining reliable infrastructure operations, strengthening platform performance, and ensuring stable Kubernetes and OpenShift services across multiple environments. The role will partner with technical teams to improve automation, availability, and operational readiness for business-critical systems.<br><br>Responsibilities:<br>• Administer a broad Linux server estate across production, quality assurance, and development environments, ensuring consistent performance and system stability.<br>• Plan and carry out operating system maintenance activities, including version upgrades, routine patching, and kernel updates with minimal service disruption.<br>• Track infrastructure health and fine-tune compute, memory, storage, and network performance to improve reliability and efficiency.<br>• Oversee account access controls and support authentication connectivity with enterprise directory services.<br>• Verify backup integrity and contribute to disaster recovery preparedness through regular review and testing activities.<br>• Manage Kubernetes and OpenShift platforms across environments, including deployment, day-to-day administration, and ongoing support.<br>• Lead cluster lifecycle activities such as installation, expansion, patching, version upgrades, and capacity adjustments.<br>• Configure and maintain core platform components such as namespaces, security settings, ingress, load balancing, and persistent storage resources.<br>• Assist application teams with container onboarding, issue resolution, and workload optimization while supporting CI/CD integration for container-based deployments.<br>• Monitor platform health with tools such as Prometheus, Grafana, and native OpenShift capabilities, and use findings to guide capacity planning.