AI Performance Engineer
![]() | |
![]() United States, California, San Jose | |
![]() 170 W Tasman Dr (Show on map) | |
![]() | |
The application window is expected to close on: Aug 31, 2025 NOTE: Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received. Meet the Team The Cisco UCS Performance team optimizes and benchmarks the performance of UCS products including AI servers and X-series & C-series servers. Join us as a highly motivated and driven Technical Marketing Engineer to define, validate, and drive compute & AI performance on UCS. Your Impact The Cisco UCS Compute BU is looking for a Technical Marketing Engineer (TME) for Compute platform performance with specific expertise in AI server performance, liquid cooling platform testing and industry standard AI benchmarking like MLPerf Inferencing & Training. The right candidate will have extensive hands-on experience in performance testing with these platforms (NVIDIA DGX/HGX platforms) with specific knowledge of liquid cooling technology. This is an onsite role at Sanjose. Role also includes the following: *Conduct performance tests on UCS AI platforms. *Use industry-standard benchmarking tools (e.g., MLPerf, GenAI perf) to evaluate system performance. *In-depth analysis and evaluation of AI Servers with Liquid cooling technologies. *Provide technical guidance to improve UCS performance specific to liquid cooling environments. *Build performance benchmarks, analyze results, and develop technical marketing materials (white papers, standard process guides, presentations). *Solve Cisco UCS AI server related performance issues. *Collaborate with engineering, product management, and sales teams to ensure industry-leading UCS performance on AI servers. *Support Cisco Technical support for performance debugging and optimizations. *Must be a self-starter and a teammate. *Ability to set and meet timelines. Minimum Qualifications * Bachelor's or master's degree with 7+ years' experience in Hardware Engineering, Performance Engineering or similar engineering roles. *Experience with Performance Evaluation of AI platforms using various profiling and benchmarking tools, including but not limited to analyzing computational efficiency, latency, throughput, and resource utilization. *2+ years of experience with multiple GPU technology/architectures such as NVIDIA HGX/DGX & AMD instinct accelerators. *4+ years of experience working with Liquid cooling technologies such as Direct to chip, Immersion cooling. *Experience working with cluster scale testing on AI infrastructure or optimizing UCS compute & AI infrastructure to support high-performance AI/ML training and inference. Preferred Qualifications *Understanding of high-performance computing AI workloads, and other AI infrastructure components. *Collaborate with hardware and software teams to optimize system configurations for AI platform performance. *Familiarity with deep learning frameworks (TensorFlow, PyTorch, etc.) and performance optimization. *Machine Learning/AI Knowledge: Understanding of machine learning models, neural networks, and deep learning architectures. *Identify performance bottlenecks and implement optimizations to improve training & inference models, reduce latency, and optimize resource usage. *Strong analytical and problem-solving skills with the ability to diagnose and address performance issues in complex systems *Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams. #Compute2025 Why Cisco? At Cisco, we're revolutionizing how data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint. Simply put - we power the future. Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you'll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere. We are Cisco, and our power starts with you. |