<p><strong>Overview:</strong></p><p>We are seeking an experienced AI Network Engineer to support and optimize high-performance infrastructure powering AI/ML workloads. This role focuses on designing and maintaining GPU-accelerated environments leveraging NVIDIA technologies, high-throughput networking, and low-latency architectures.</p><p><br></p><p><strong>Key Responsibilities:</strong></p><ul><li>Design, implement, and support <strong>high-performance networks for AI/ML workloads</strong>, including GPU clusters and distributed training environments</li><li>Deploy and optimize <strong>NVIDIA-based infrastructure</strong> (DGX systems, HGX platforms, or GPU clusters)</li><li>Configure and manage <strong>high-speed networking technologies</strong> such as InfiniBand, RoCE, and 100/200/400Gb Ethernet</li><li>Optimize <strong>network performance for east-west traffic</strong>, low latency, and large data throughput required for AI model training</li><li>Integrate <strong>NVIDIA software stack</strong> (CUDA, NCCL, GPU Cloud, AI Enterprise) with networking and compute environments</li><li>Troubleshoot performance bottlenecks across <strong>network, storage, and GPU interconnects</strong></li><li>Collaborate with AI/ML engineers to ensure infrastructure meets training and inference demands</li><li>Support automation and infrastructure-as-code initiatives for scalable AI environments</li></ul><p><br></p>