CrawlJobs Logo

Sr. Deployment Engineer, AI Inference

cerebras.net Logo

Cerebras Systems

Location Icon

Location:
United States; Canada , Sunnyvale

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. We are seeking a highly skilled and experienced Sr. Deployment Engineer to build and operate our cutting-edge inference clusters. These clusters would provide the candidate an opportunity to work with the world's largest computer chip, the Wafer-Scale Engine (WSE), and the systems that harness its unparalleled power. You will play a critical role in ensuring reliable, efficient, and scalable deployment of AI inference workloads across our global infrastructure.

Job Responsibility:

  • Deploy AI inference replicas and cluster software across multiple datacenters
  • Operate across heterogeneous datacenter environments undergoing rapid 10x growth
  • Maximize capacity allocation and optimize replica placement using constraint-solver algorithms
  • Operate bare-metal inference infrastructure while supporting transition to K8S-based platform
  • Develop and extend telemetry, observability and alerting solutions to ensure deployment reliability at scale
  • Develop and extend a fully automated deployment pipeline to support fast software updates and capacity reallocation at scale
  • Translate technical and customer needs into actionable requirements for the Dev Infra, Cluster, Platform and Core teams
  • Stay up to date with the latest advancements in AI compute infrastructure and related technologies.

Requirements:

  • 5-7 years of experience in operating on-prem compute infrastructure (ideally in Machine Learning or High-Performance Compute) or in developing and managing complex AWS plane infrastructure for hybrid deployments
  • Strong proficiency in Python for automation, orchestration, and deployment tooling
  • Solid understanding of Linux-based systems and command-line tools
  • Extensive knowledge of Docker containers and container orchestration platforms like K8S
  • Familiarity with spine-leaf (Clos) networking architecture
  • Proficiency with telemetry and observability stacks such as Prometheus, InfluxDB and Grafana
  • Strong ownership mindset and accountability for complex deployments
  • Ability to work effectively in a fast-paced environment.
What we offer:
  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs.

Additional Information:

Job Posted:
February 17, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Sr. Deployment Engineer, AI Inference

Sr. Distinguished AI Engineer

At Capital One, we are creating responsible and reliable AI systems, changing ba...
Location
Location
United States , Cambridge, Massachusetts; New York, New York; Richmond, Virginia; San Jose, California; McLean, Virginia; San Francisco, California
Salary
Salary:
280600.00 - 384200.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 10 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 8 years of experience developing AI and ML algorithms or technologies
  • At least 10 years of experience programming with Python, Go, Scala, or Java
Job Responsibility
Job Responsibility
  • Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products
  • Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability
  • Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more
  • Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems
  • Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One
What we offer
What we offer
  • comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being
  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • Fulltime
Read More
Arrow Right

Sr. Engineer, ML Platform

As the leading delivery platform in the region, we have a unique responsibility ...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
deliveryhero.com Logo
Delivery Hero
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering background with experience in building distributed systems or platforms designed for machine learning and AI workloads
  • Expert-level proficiency in Python and familiarity with ML frameworks (TensorFlow, PyTorch), infrastructure tooling (MLflow, Kubeflow, Ray), and popular APIs (Hugging Face, OpenAI, LangChain)
  • Experience implementing modern MLOps practices, including model lifecycle management, CI/CD, Docker, Kubernetes, model registries, and infrastructure-as-code tools (Terraform, Helm)
  • Demonstrated experience working with cloud infrastructure, ideally AWS or GCP, including Kubernetes clusters (GKE/EKS), serverless architectures, and managed ML services (e.g., Vertex AI, SageMaker)
  • Proven experience with generative AI technologies: transformers, embeddings, prompt engineering strategies, fine-tuning vs. prompt-tuning, vector databases, and retrieval-augmented generation (RAG) systems
  • Experience designing and maintaining real-time inference pipelines, including integrations with feature stores, streaming data platforms (Kafka, Kinesis), and observability platforms
  • Familiarity with SQL and data warehouse modeling
  • capable of managing complex data queries, joins, aggregations, and transformations
  • Solid understanding of ML monitoring, including identifying model drift, decay, latency optimization, cost management, and scaling API-based genAI applications efficiently
  • Bachelor’s degree in Computer Science, Engineering, or a related field
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable, reusable, and reliable ML platforms and tooling that support the entire ML lifecycle, including data ingestion, model training, evaluation, deployment, and monitoring for both traditional and generative AI models
  • Develop standardized ML workflows and templates using MLflow and other platforms, enabling rapid experimentation and deployment cycles
  • Implement robust CI/CD pipelines, Docker containerization, model registries, and experiment tracking to support reproducibility, scalability, and governance in ML and genAI
  • Collaborate closely with genAI experts to integrate and optimize genAI technologies, including transformers, embeddings, vector databases (e.g., Pinecone, Redis, Weaviate), and real-time retrieval-augmented generation (RAG) systems
  • Automate and streamline ML and genAI model training, inference, deployment, and versioning workflows, ensuring consistency, reliability, and adherence to industry best practices
  • Ensure reliability, observability, and scalability of production ML and genAI workloads by implementing comprehensive monitoring, alerting, and continuous performance evaluation
  • Integrate infrastructure components such as real-time model serving frameworks (e.g., TensorFlow Serving, NVIDIA Triton, Seldon), Kubernetes orchestration, and cloud solutions (AWS/GCP) for robust production environments
  • Drive infrastructure optimization for generative AI use-cases, including efficient inference techniques (batching, caching, quantization), fine-tuning, prompt management, and model updates at scale
  • Partner with data engineering, product, infrastructure, and genAI teams to align ML platform initiatives with broader company goals, infrastructure strategy, and innovation roadmap
  • Contribute actively to internal documentation, onboarding, and training programs, promoting platform adoption and continuous improvement
  • Fulltime
Read More
Arrow Right
New

Senior inside sales account representative

Calling all Inside Sales professionals in Ottawa! We are working with a global p...
Location
Location
Canada , Ottawa
Salary
Salary:
22.50 CAD / Hour
https://www.randstad.com Logo
Randstad
Expiration Date
April 06, 2026
Flip Icon
Requirements
Requirements
  • Must have 2 years of Sales experience- relationship sales, ad sales, digital marketing
  • Must have 4 years of work experience in total
  • Educational requirement: minimum high school diploma or equivalency
  • Flexible and quick learner, able to adapt to continuously evolving needs to help clients grow their business
  • Highly motivated with strong attention to detail
  • Excellent listening, interpersonal, with a solutions mindset and passion for customer satisfaction
Job Responsibility
Job Responsibility
  • Manage a portfolio of high spending advertisers of small to medium-sized businesses to grow revenue with higher product savviness
  • Research and discover fundamental understanding of vertical(s)/market(s) industry and client business models
  • Identify sales opportunities and provide clients with tailored solution(s) to meet their goal(s) ...
  • Personalize client recommendations to suit varying needs, using vertical expertise
  • Improve clients’ experiences by consulting on optimal and appropriate product adoption
  • Proactively outreach (outbound calls) and reactively engage with clients over the phone and via email to frequently assess product performance and provide insights to optimize their advertisements and increase their investments
  • Demonstrate ability to build relationships with the creation of long-term marketing and sales strategies.
  • Understanding customers’ business and advertising goals to capture and build market intelligence.
  • Develop expert working knowledge of Social Media advertising solutions (including measurement) and what impacts performance, identify trends and solve problems
What we offer
What we offer
  • Bonus within 3 months
  • 6-8 weeks of hands on training and support (Monday- Friday 9:00 am-5:30 pm EST)
  • Opportunity to join a global professional service company, geared towards supporting businesses with optimizing operations and leveraging technology
  • Fulltime
Read More
Arrow Right
New

Shift Supervisor

We’re building a world of health around every individual — shaping a more connec...
Location
Location
United States , Enola
Salary
Salary:
16.50 - 24.00 USD / Hour
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
April 25, 2026
Flip Icon
Requirements
Requirements
  • Deductive reasoning ability, analytical skills and computer skills
  • Advanced communication skills and supervision skills
  • Ability to work a flexible schedule, including some early morning, overnight and weekend shifts, to work overtime as needed, and to respond to urgent issues at the store when they arise
Job Responsibility
Job Responsibility
  • Work effectively with store management and store crews
  • Supervise the store’s crew through assigning, directing and following up of all activities
  • Effectively communicate information both to and from store management and crews
  • Assist customers with their questions, problems and complaints
  • Promote CVS customer service culture. (Greet, offer help, and thank)
  • Handle all customer relations issues in accordance with company policy and promote a positive shopping experience for all CVS customers
  • Maintain customer/patient confidentiality
  • Price merchandise
  • Stock shelves
  • Execute the displays, sign and inventory of weekly, promotional, and seasonal merchandise
What we offer
What we offer
  • Affordable medical plan options
  • 401(k) plan (including matching company contributions)
  • Employee stock purchase plan
  • No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching
  • Paid time off
  • Flexible work schedules
  • Family leave
  • Dependent care resources
  • Colleague assistance programs
  • Tuition assistance
  • Parttime
Read More
Arrow Right
New

Facilities Maintenance Technician

Under limited supervision, the Facilities Maintenance Technician is responsible ...
Location
Location
United States , Birmingham
Salary
Salary:
Not provided
allianceautomotive.co.uk Logo
Alliance Automotive UK LV Ltd
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • High school diploma or GED
  • At least five (5) years of commercial maintenance experience or an equivalent combination
  • Knowledge of plumbing, electrical, HVAC, and general maintenance practices
  • Ability to diagnose and fix problems efficiently
  • Ability to perform physically demanding tasks
  • Ability to operate equipment safely
Job Responsibility
Job Responsibility
  • Performing routine maintenance
  • Making repairs to plumbing, electrical, HVAC systems, and general building structures
  • Inspecting buildings and systems
  • Responding to maintenance requests
  • Maintaining inventory
  • Ensuring safety compliance
  • Coordination with contractors and other departments
  • Documentation of inspections, repairs, and maintenance activities
  • Problem-solving
  • Communication with employees, staff, and contractors
What we offer
What we offer
  • Healthcare coverage options
  • 401(k)
  • Tuition reimbursement
  • Vacation pay
  • Sick pay
  • Holiday pay
  • Fulltime
Read More
Arrow Right
New

Pharmacy Intern

You’ve invested a lot of time and energy in your education. Now you want the cha...
Location
Location
United States , Kannapolis
Salary
Salary:
18.00 - 19.75 USD / Hour
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
March 01, 2026
Flip Icon
Requirements
Requirements
  • Accepted into or actively enrolled in an ACPE accredited college or school of pharmacy
  • 0-3 years relevant work experience
  • Must possess or be in process of obtaining valid intern and/or technician licensure as required
  • Strong communication and presentation skills
  • Complete all required training within state guidelines and required timeframe
  • Ability to: Have regular and predictable attendance, including nights and weekends
  • Be mobile and remain upright for extended periods of time
  • Lift, scan, and bag items
  • Finger Dexterity: Picking, pinching, typing or otherwise working primarily with fingers rather than whole hand or arm
  • Reach overhead
Job Responsibility
Job Responsibility
  • Patient Safety
  • Pharmacy Professional Practice
  • Regulatory Requirements
  • Quality Assurance
  • Customer Service
  • Inventory Management
  • Workflow Management excluding final prescription verification
  • Lead with Heart – display empathy and compassion for your patients, customers, caregivers, and colleagues on your team
  • Seek new ways to grow, collaborate with others, and deliver better outcomes
  • Align others around our purpose to bring your heart to every moment of your health and gain support and commitment
!
Read More
Arrow Right
New

Solutions Engineer - LATAM

Intercom is looking for an exceptional Solutions Engineer to join our vibrant Cu...
Location
Location
United States , San Francisco
Salary
Salary:
163000.00 - 215000.00 USD / Year
intercom.com Logo
Intercom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 4 years of experience in a technical pre-sales role, managing C-Level technical and business relationships
  • Portuguese Fluency required
  • Spanish also preferred
  • Strong technical acumen with experience in conducting technical discovery and delivering high-impact value presentations
  • Ability to solve problems independently while thriving in collaborative team environments
  • Proven time management skills in a dynamic team environment
  • Demonstrated ability to quickly identify and communicate the value proposition throughout the sales cycle
  • Leverage your skills in translating complex business challenges into tailored Intercom solutions, effectively communicating with both technical and non-technical audiences to drive understanding and impact
  • Experience working closely with product and engineering teams to communicate customer feedback and influence product direction
  • Familiarity with managing POCs, RFPs, and addressing complex security questions
Job Responsibility
Job Responsibility
  • Lead technical discovery to identify and address our customers’ needs
  • Deliver an exceptional pre-sales experience by articulating Intercom's value and technical expertise
  • Conduct impactful, value-based solution reviews and in-depth technical sessions
  • Design and lead tailored Proof of Concepts (POCs) that showcase Intercom's capabilities
  • Serve as the primary resource for RFPs and customer security questions, utilizing standardized materials and escalating complex issues
  • Collaborate cross-functionally with Product and Engineering to represent the customer voice, gathering feedback and insights for product planning
  • Build trust with customers by approaching challenges with empathy and curiosity
  • Continuously improve processes and contribute to building a best-in-class Solutions Engineering playbook at Intercom
  • Commit to customer success, ensuring lasting value and proactively addressing challenges
What we offer
What we offer
  • Competitive salary and meaningful equity
  • Comprehensive medical, dental, and vision coverage
  • Regular compensation reviews - great work is rewarded!
  • Flexible paid time off policy
  • Paid Parental Leave Program
  • 401k plan & match
  • In-office bicycle storage
  • Fun events for Intercomrades, friends, and family!
  • Fulltime
Read More
Arrow Right
New

Solutions Engineer - LATAM

Intercom is looking for an exceptional Solutions Engineer to join our vibrant Cu...
Location
Location
United States , Chicago
Salary
Salary:
146750.00 - 193500.00 USD / Year
intercom.com Logo
Intercom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 4 years of experience in a technical pre-sales role, managing C-Level technical and business relationships
  • Portuguese Fluency required
  • Spanish also preferred
  • Strong technical acumen with experience in conducting technical discovery and delivering high-impact value presentations
  • Ability to solve problems independently while thriving in collaborative team environments
  • Proven time management skills in a dynamic team environment
  • Demonstrated ability to quickly identify and communicate the value proposition throughout the sales cycle
  • Leverage your skills in translating complex business challenges into tailored Intercom solutions, effectively communicating with both technical and non-technical audiences to drive understanding and impact
  • Experience working closely with product and engineering teams to communicate customer feedback and influence product direction
  • Familiarity with managing POCs, RFPs, and addressing complex security questions
Job Responsibility
Job Responsibility
  • Lead technical discovery to identify and address our customers’ needs
  • Deliver an exceptional pre-sales experience by articulating Intercom's value and technical expertise
  • Conduct impactful, value-based solution reviews and in-depth technical sessions
  • Design and lead tailored Proof of Concepts (POCs) that showcase Intercom's capabilities
  • Serve as the primary resource for RFPs and customer security questions, utilizing standardized materials and escalating complex issues
  • Collaborate cross-functionally with Product and Engineering to represent the customer voice, gathering feedback and insights for product planning
  • Build trust with customers by approaching challenges with empathy and curiosity
  • Continuously improve processes and contribute to building a best-in-class Solutions Engineering playbook at Intercom
  • Commit to customer success, ensuring lasting value and proactively addressing challenges
What we offer
What we offer
  • Competitive salary and meaningful equity
  • Comprehensive medical, dental, and vision coverage
  • Regular compensation reviews - great work is rewarded!
  • Flexible paid time off policy
  • Paid Parental Leave Program
  • 401k plan & match
  • In-office bicycle storage
  • Fun events for Intercomrades, friends, and family!
  • All regular employees may also be eligible for the corporate bonus program or a sales incentive (target included in OTE) as well as stock in the form of Restricted Stock Units (RSUs)
  • Fulltime
Read More
Arrow Right