This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
With the continued advancement of breakthrough artificial intelligence models, the Azure Core team is expanding to deliver the foundational platform that will support the next generation of artificial intelligence workloads. The team is hiring experienced engineers to join agile engineering groups within the Host Networking and Hardware Acceleration team. As a Principal Software Engineer in this team, you will contribute to Azure’s host networking infrastructure by designing and deploying agents that integrate hardware and software components to deliver reliable and high-performance networking across current and future Azure hosts. You will also develop servicing software for Network Interface Cards to support artificial intelligence and other Azure workloads. This role offers the opportunity to deepen expertise in service-oriented architecture, distributed systems, low-level programming, operating system internals, virtualization, large-scale infrastructure, performance optimization, hardware offloading, and testing methodologies. The team supports flexible work arrangements, including remote and hybrid models, and values collaboration, technical excellence, and impact across one of the most critical services in the cloud.
Job Responsibility:
Collaborates with appropriate stakeholders to determine user requirements for a scenario
Drives identification of dependencies and the development of design documents for a product, application, service, or platform
Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI)
Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items
Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate
Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, Rust, C++, or Python
OR equivalent experience
6+ years of experience working in low level systems development (Ex. networking, drivers, etc.)
Ability to meet Microsoft, customer and/or government security screening requirements
Microsoft Cloud Background Check
Nice to have:
Bachelor's Degree in Computer Science OR related technical field AND 10+ years technical engineering experience with coding in languages including, but not limited to, C, Rust, C++, OR Python
OR Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, Rust, C++, OR Python
OR equivalent experience
1+ year(s) experience with virtualization and networking technologies, including: Hyper-V, Kernel-based Virtual Machine (KVM), VMware, and Network Interface Card (NIC) hardware offloads
Microservices, service-oriented architecture, and distributed systems
Data Plane Development Kit (DPDK) and continuous integration/continuous deployment (CI/CD) practices
1+ year(s) experience designing scalable systems with a focus on performance, reliability, and maintainability in cloud environments