This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for a Senior Software Engineer to join our team to drive all aspects of AI feature fundamentals for one of the biggest modern collaboration platforms in the world - Microsoft Teams. We help feature teams ship quality AI experiences out of the gate, track key performance and reliability metrics for critical high-volume scenarios, aid feature teams in improving the debuggability of AI scenarios, help create offline and online evals for all AI features by incorporating into release pipelines and drive culture of performance by promoting best practices and consulting. As a team, we obsess about learning, diving deep into areas of opportunities, experimenting and use an evidence-based approach to turning opportunities into positive impact on performance of the product through collaboration. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Job Responsibility:
Build and evolve the Real-Time Intelligence evaluations platform: implement offline and online eval pipelines, including golden datasets, human review workflows, and LLM-as-judge / auto-raters for agents, anomaly detectors, and decisioning systems
Instrument agentic solutions for observability by wiring up telemetry, tracing, structured logging, and dashboards so quality, safety, latency, and cost are easy to monitor and debug
Integrate evals into the development lifecycle by connecting pipelines to CI/CD, canary and A/B experiments, and phased rollouts, making it simple for partner teams to run and interpret evaluations
Collaborate and mentor across product, research, and engineering teams, sharing best practices on eval design, LLM-as-judge usage, and Responsible AI, and providing code reviews and guidance that raise the bar for the AI features
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
2+ years of experience on engineering tooling or eval development
1+ years experience in driving fundamentals for AI features within web apps
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Nice to have:
Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Prior experience in working on services at scale
Understanding of building engineering tools on the server side for scale
Prior experience in working closely with AI feature teams and improving fundamentals like performance and reliability is a major plus
Experience solving challenging problems and cross team/organization collaboration skills
Proficiency with React is a plus
Curiosity to dive deep, continuously learn and experiment