CrawlJobs Logo

Senior Manager, Performance AI/ML Network Deployment Engineering

amd.com Logo

AMD

Location Icon

Location:
United States, Santa Clara

Category Icon
Category:
IT - Software Development

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

210400.00 - 315600.00 USD / Year

Job Description:

The Senior Manager, DC GPU Advanced Forward Deployment and Systems Engineering is a leadership position designed to optimize the design, roll-out and post-rollout management of AI/ML Fabrics. The candidate will be the technical interface between the customers and various internal engineering groups, field application engineers Leveraging extensive experience in large network architecture, Storage, AI/ML network deployments, and performance tuning, this role requires a disciplined approach to system triage, at-scale debug, and infrastructure optimization to ensure robust performance and efficient transitions from GPU production qualification to at-scale datacenter deployment.

Job Responsibility:

  • Collaborate with strategic customers on scalable designs involving compute, networking, storage environment, work with industry partners, Internal teams to accelerate the deployment, adoption of various AI/ML models
  • Engage system-level triage and at-scale debug of complex issues across hardware, firmware, and software, ensuring rapid resolution and system reliability
  • Drive the ramp of Instinct-based large scale AI datacenter infrastructure based on NPI base platform hardware with ROCm, scaling up to pod and cluster level, leveraging the best in network architecture for AI/ML workloads
  • Enhance tools and methodologies for large-scale deployments to meet customer uptime goals and exceed performance expectations
  • Engage with clients to deeply understand their technical needs, ensuring their satisfaction with tailored solutions that leverage your past experience in strategic customer engagements and architectural wins
  • Provide domain specific knowledge to other groups at AMD, share the lessons learnt to drive continuous improvement
  • Engage with AMD product groups to drive resolution of application and customer issues
  • Develop and present training materials to internal audiences, at customer venues, and at industry conferences

Requirements:

  • Expertise in networking and performance optimization for large-scale AI/ML networks, including network, compute, storage cluster design, modelling, analytics, performance tuning, convergence, scalability improvements
  • Prefer candidates with solid, hands-on expertise in at least one or more of 3 domains, namely compute, network, storage
  • Experience in working with large customers such as Cloud Service Providers and global enterprise customers
  • Proven leadership in engaging customers with diverse technical disciplines in avenues such as Proof of Concept, Competitive evaluations, Early Field Trials etc
  • Direct experience in working with large customers and can operate with sense of urgency, own the problems and resolve it
  • Demonstrated leadership in network architecture, hands on experience in RoCEv2 Design, VXLAN-EVPN, BGP, and Lossless Fabrics
  • Proven ability to influence design and technology roadmaps, leveraging a deep understanding of datacenter products and market trends
  • Extensive hands-on Network deployment expertise and proven track record of delivering large projects on time. Cisco, Juniper or Arista experience is preferred
  • Direct, co-development/deployment experience in working with strategic customers/partners in bringing solutions to market
  • Excellent communication level from engineer to mid-management to C-level of audience
  • Bachelors, master's in computer science, Engineering or related subjects of experience
  • This is a Senior level role
  • no recent college graduates will be considered
  • Ability to work well in a geographically dispersed team
  • Certifications in Networking, AI/ML, or Cloud Technologies

Additional Information:

Job Posted:
December 17, 2025

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Manager, Performance AI/ML Network Deployment Engineering

Senior Cloud Infrastructure Security Engineer

Truveta is the world’s first health provider led data platform with a vision of ...
Location
Location
United States , Seattle; Bellevue
Salary
Salary:
135000.00 - 180000.00 USD / Year
truveta.com Logo
Truveta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A minimum bachelor’s in Computer Science, Software Engineering, Electrical or Electronics Engineering, Information Systems, or equivalent
  • 5+ years’ experience in public cloud networking & security design, implementation & support
  • Experience of TCP/IP IPv4/v6, office network (Routing/Switching/WAN, Wi-Fi & Security) management
  • 3+ years automation experience in Azure Cloud Networking / Azure DevOps or GitHub CI/CD pipelines in any of the following: Python, PowerShell, Terraform, Bicep, YAML template
  • 3+ years network security practices in on-premises and/or cloud environment
  • Experience managing and supporting Windows Desktop OS, MacOS, managed endpoint administration at scale across an enterprise sized environment
  • Understanding of the Windows Desktop/Mac OS packaging, scripting, and automated deployment tools, such as Microsoft Intune and Jamf.
  • Ability to participate in on-call rotation
Job Responsibility
Job Responsibility
  • Design and implement Azure cloud-based infrastructure, including using tools for infrastructure as code(IaC) and automation to meet technical, security and business needs.
  • Design and implement Azure cloud environments (tenant, subscription, VM, storage account, databases, networking, firewalling) optimized for AI/ML workloads.
  • Manage and maintain Azure Networking, Azure firewalls/VPN and associated policies/rules, Web Application Firewall, Application Gateway, Front Door, VNET peering, ensuring security, availability, scalability, and performance.
  • Secure Azure Kubernetes clusters, containers, and images.
  • Establish and enforce Azure security policies, manage access controls, and ensure the infrastructure complies with relevant regulations.
  • Automate tenant and infrastructure provisioning, deployments, and other routine tasks to increase efficiency.
  • Monitor Azure cloud resources, analyze performance, and troubleshoot issues as they arise.
  • Perform incident troubleshoot and problem resolution for office network, cloud infrastructure, and own postmortems.
  • Work with Engineering teams and external teams, gather requirements, develop and integrate cloud solutions and support business needs.
  • Actively participate in architecture, code reviews, presentations, share learns and best practices to enable flawless deployment and quality operations.
What we offer
What we offer
  • Interesting and meaningful work for every career stage
  • Great benefits package
  • Comprehensive benefits with strong medical, dental and vision insurance plans
  • 401K plan
  • Professional development & training opportunities for continuous learning
  • Work/life autonomy via flexible work hours and flexible paid time off
  • Generous parental leave
  • Regular team activities (virtual and in-person)
  • Additional compensation such as incentive pay and stock options (for certain roles)
  • Fulltime
Read More
Arrow Right

Senior DevOps Engineer (GCP)

Our client is a global UK-based financial services and investment banking organi...
Location
Location
Salary
Salary:
Not provided
n-ix.com Logo
N-iX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in DevOps, Cloud Engineering, or SRE roles
  • Strong hands-on experience with Google Cloud Platform, including: GKE / Kubernetes, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage, VPC, IAM, networking, security
  • Expertise in Terraform, Helm, or other IaC tools
  • Experience building CI/CD pipelines (GitHub Actions, GitLab CI, CircleCI, Jenkins, etc.)
  • Strong understanding of containerization and orchestration: Docker, Kubernetes
  • Solid experience with monitoring, observability, and logging stacks
  • Familiarity with networking, load balancing, security hardening, and zero-trust principles
  • Experience supporting production systems in high-availability, distributed environments
  • Strong scripting skills (Python, Bash, or similar)
  • Experience working with agile engineering teams
Job Responsibility
Job Responsibility
  • Design, implement, and maintain cloud infrastructure on Google Cloud (GKE, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage)
  • Build and optimize CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar)
  • Develop infrastructure-as-code using Terraform or similar tools
  • Set up and maintain container orchestration (Kubernetes, GKE) and automated deployment workflows
  • Implement monitoring, alerting, and observability using tools such as Prometheus, Grafana, ELK/Elastic, Stackdriver, or OpenTelemetry
  • Ensure compliance with security and governance standards across all environments
  • Collaborate closely with engineering teams to ensure scalable, high-performance deployment architectures
  • Support AI/ML and GenAI workloads (Vertex AI pipelines, model hosting, GPU workloads, inference optimization)
  • Manage environment strategies, release pipelines, configuration management, and secrets management
  • Optimize cloud costs and recommend improvements for performance and reliability
What we offer
What we offer
  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits
Read More
Arrow Right

Senior Devops & AI Engineer

This role presents a unique opportunity to contribute to the future of impactful...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
fissionlabs.com Logo
Fission Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or related field
  • 6+ years of experience in Infrastructure Mgmt. roles, with a focus on cloud platforms (Azure and AWS Preferred)
  • Hands-on experience with operations (DevSecOps) principles and best practices
  • Proficiency in scripting languages such as Python, PowerShell, or Bash
  • Excellent communication and collaboration skills
  • In-depth knowledge of Linux operating systems, including CentOS, Ubuntu, and Red Hat, with expertise in shell scripting, package management, and system administration
  • Hands-on experience with a wide range of AWS and Azure services
  • Develop and maintain Infrastructure as Code (IAC) templates using tools such as Terraform or AWS CloudFormation
  • Experience setting up cloud infrastructure stack, databases, service endpoints, GPU as well as CPU resource scaling, optimization etc.
  • Should have worked AIOps/MLOP
Job Responsibility
Job Responsibility
  • Configure and optimize Linux-based servers for performance, security, and resource utilization, including kernel tuning, file system management, and network configuration
  • Architect cloud solutions leveraging best practices and services offered by AWS and Azure, optimizing for scalability, reliability, and cost-effectiveness
  • Implement and manage hybrid cloud environments, facilitating seamless integration and interoperability between AWS and Azure services
  • Establish version control practices for IAC templates, ensuring traceability, auditability, and reproducibility of infrastructure changes
What we offer
What we offer
  • Opportunity to work on impactful technical challenges with global reach
  • Vast opportunities for self-development, including online university access and knowledge sharing opportunities
  • Sponsored Tech Talks & Hackathons to foster innovation and learning
  • Generous benefits packages including health insurance, retirement benefits, flexible work hours, and more
  • Supportive work environment with forums to explore passions beyond work
  • Fulltime
Read More
Arrow Right

Engineering Director

We are seeking a seasoned Engineering Director who thrives in challenging and fa...
Location
Location
Puerto Rico , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Significant work experience as a director or similar position working across multiple stakeholder organizations, with at least 10+ years of people leadership experience specific to SW and Cloud engineering
  • Solid experience leading SW development across storage, networking, on-prem, and SaaS is a must
  • Experience in setting up geographically distributed sites
  • Must have a strong background in software development lifecycle including cloud infrastructure
  • Familiarity with agile methodologies and tools like JIRA
  • Prior experience in cloud product development and deployments
  • end to end ownership and accountability
  • Solid understanding of fundamental AI and machine learning concepts, including supervised and unsupervised learning, deep learning, reinforcement learning, natural language processing, computer vision, and statistical modeling
  • Extensive business acumen, technical knowledge, and industry experience encompassing one or more engineering, technology, and product domains
  • Demonstrated abilities to drive transformation across a business with exceptional skills in the management of change
Job Responsibility
Job Responsibility
  • Oversee the Puerto Rico Site daily operations, strategic planning and cross-functional team leadership for Hybrid Cloud
  • Recruit, mentor, and manage teams of AI/ML engineers, QA Engineers, Design Engineers and innovation specialists to deliver cutting-edge solutions
  • Continuously evaluate new tools, platforms, and frameworks in AI/ML to drive competitive advantage and operational efficiency
  • Ensure alignment with corporate goals while fostering a high-performance culture, operational efficiency, and employee engagement
  • Lead the development and execution of AI/ML strategies that align with business goals and drive innovation across products, services, or operations
  • Create strategic and tactical operations and resource plans, goals, and priorities for assigned organization based on business and technology roadmap and functional objectives
  • Engage with various senior leaders across the organization, program managers, R&D, support, Quality, product managers, technical leaders and executives to communicate program status, escalate issues, and guide and influence strategic decision-making
  • Manage senior relationships and escalated issues with outsourced partners and suppliers, including setting expectations regarding deliverables, product quality, schedules, and costs
  • ensures that organization is effectively leveraging outsourced resources
  • Identify opportunities for and drive organizational initiatives and programs to support business process improvements and cost reductions
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right
New

Account Executive

We’re hiring an Account Executive to support a busy team working with well-known...
Location
Location
United Kingdom , London; Reading; Surrey; Uxbridge
Salary
Salary:
25000.00 - 30000.00 GBP / Year
asginternational.co.uk Logo
ASG International
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience working directly with clients in a fast-moving environment
  • Great attention to detail and a proactive, solution-focused mindset
  • Ability to juggle multiple projects with confidence
  • Strong communication and relationship-building skills
  • Commercial awareness and a good understanding of deadlines and budgets
  • A genuine interest in marketing, print, and creative production
Job Responsibility
Job Responsibility
  • Support the Account Manager with day-to-day project delivery
  • Take client briefs, capture key requirements, and make sure deadlines are met
  • Manage orders and coordinate with suppliers to ensure smooth production
  • Provide technical input and help identify smart, efficient solutions
  • Monitor progress and make sure every project is delivered on time and to spec
  • Keep all activity recorded accurately using internal systems
  • Build great relationships with clients, suppliers, and internal teams
  • Ensure invoicing and admin are completed correctly
  • Deliver consistently high levels of service and communication
  • Fulltime
Read More
Arrow Right
New

Account Manager – Promotional Merchandise

Join an innovative leader in retail marketing services. Who specialise in print,...
Location
Location
United Kingdom , West London
Salary
Salary:
35000.00 - 40000.00 GBP / Year
asginternational.co.uk Logo
ASG International
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Account/Project/Production Management: Hands-on experience in promotional merchandise
  • Sourcing Expertise: Ability to source and manage products
  • Industry Experience: FMCG experience is preferred
  • Eco Marketing Passion: A commitment to sustainable and eco-friendly marketing materials
Job Responsibility
Job Responsibility
  • Oversee the entire lifecycle of promotional merchandise projects, from the initial brief to final delivery
  • Collaborating with internal and external stakeholders
  • Ensuring projects are completed on time, within budget, and to the highest quality
  • Crafting solutions for in-store marketing campaigns
  • Applying design principles to your projects
  • Fulltime
Read More
Arrow Right
New

Senior Account Manager

Joining our client as Senior Account Manager opens the door to a dynamic role wh...
Location
Location
United Kingdom , Manchester
Salary
Salary:
35000.00 - 45000.00 GBP / Year
asginternational.co.uk Logo
ASG International
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in Marketing Print and/Or POS
  • Experienced leader who can balance both their own daily work as well as assist with the wider team
  • Varied experience of clients from varying industries
  • Experience of delivering client relationships to time and budget
Job Responsibility
Job Responsibility
  • Manage projects and foster client and supplier relationships
  • Understand clients’ marketing needs and navigate environmental parameters
  • Handle intricate projects from start to finish, ranging from Marketing, Print, Retail POS as well as branded merchandise
  • Own the project lifecycle
  • Achieve growth targets
  • Support and guide the team on project bases
  • Initiate new projects and oversee existing ones
  • Schedule meetings and ensure stakeholders and suppliers stay informed about critical decision points
  • Fulltime
Read More
Arrow Right
New

Account Manager

Account Manager – Promotional Merchandise. We’re looking for an Account Manager ...
Location
Location
United Kingdom , Brentwood
Salary
Salary:
35000.00 GBP / Year
asginternational.co.uk Logo
ASG International
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Industry experience – Familiarity with the full product lifecycle of promotional items such as lanyards, mugs, pens, apparel, tech products, and stationery
  • Brand connections – Experience working with global brands in Retail, Fashion, or FMCG is a plus
  • A proactive mindset – self-starter who enjoys autonomy but thrives in a collaborative, fast-paced environment
Job Responsibility
Job Responsibility
  • Manage projects for leading blue-chip clients
What we offer
What we offer
  • Work with major brands
  • Creative & growing company
  • Sustainability & innovation
  • Career growth – Opportunities to take ownership of projects, grow into leadership, and collaborate across departments
  • Modern workplace – We embrace the latest tech, promote equality, and support a flexible work culture
Read More
Arrow Right
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.