Software Engineer II - CTJ - Poly
![]() | |
![]() United States, Virginia, Reston | |
![]() | |
OverviewMicrosoft has an exciting opportunity for a Software Engineer II in the Cloud+AI Silver Team. This team is responsible for deploying and operating a Secure Work Area, including infrastructure for collaboration within an airgapped environment. You will work on systems that enable Azure services to be consumed by internal and external customers in highly secured and regulated industries, meeting stringent security and compliance requirements.Our team is collaborative, supportive, and deeply committed to delivering resilient and secure cloud services. We play a critical role in supporting Microsoft's One Plane services within airgapped clouds, including Azure Resource Manager, Azure Resource Graph, Azure Policy, and Machine Configuration. These services form the backbone of resource management and compliance in highly secure environments, and we focus on maintaining and optimizing their reliability and performance through continuous monitoring and proactive issue resolution.This role offers the opportunity to build and operate systems that meet the highest security standards while collaborating with teams across Microsoft to deliver critical services for regulated industries. You'll contribute to innovative solutions that empower customers in secure environments and help shape the future of cloud resiliency.Microsoft's mission is to empower every person and every organization on the planet to achieve more. We come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesThe scale of our operations is enormous. Microsoft's products and services are overwhelmingly consumed online, and billions of people use them every day. We need people who enjoy analyzing complicated problems, coming up with creative solutions, working in focused teams to build things no-one has thought of before, all in the service of production reliability.Act as a Designated Responsible Individual (DRI) on a rotational, on-call basis to monitor services for degradation, downtime, or interruptions. Respond within SLA, alert stakeholders, initiate restoration actions, and drive efforts to reduce incident volume through global resolutions. Contribute to postmortem reviews and share insights for systemic improvements.Develop and maintain automation for deployment and production environments. Validate functionality and reliability by running code in simulated or non-production environments, ensuring error-free runtime and adherence to best practices for scalability and performance.Collect, classify, and analyze telemetry and operational data on system health, reliability, and performance. Create dashboards and notifications, establish feedback loops, and use insights to inform product refinements and engineering decisions.Ensure compliance with Microsoft's security, privacy, and accessibility standards by following established processes and validating evidence of compliance. Maintain awareness of onboarding new technologies and their implications for security and regulatory requirements.Build and enhance developer tools and reusable components to support efficient coding practices and improve engineering workflows. Share best practices, mentor peers on tools and strategies, and leverage open-source solutions where applicable.Apply engineering best practices to design and deliver standardized, repeatable, and scalable solutions that meet customer requirements and performance expectations. Drive consistency in monitoring and operations at scale.Collaborate across teams and maintain live service operations, implementing mitigations for complex issues impacting performance or functionality. Communicate effectively with partners across Microsoft to ensure desirable user experiences and dynamic customer needs are met.Embody our culture and values |