New
Senior Network Engineer - CTJ - TS/SCI
Microsoft | |
United States, Virginia, Reston | |
Jan 11, 2025 | |
OverviewMicrosoft has an exciting opportunity for a Senior Network Engineer to join the Azure Silver and Sovereign Team as part of the Cloud Access and Data Transfer team (CADT) team. The Cloud Access and Data Transfer team enables secure access and transfer between enclaves and supports other transfer and access types enabling a wide set of capabilities within highly regulated industries. We welcome you to meet the team and learn about the complex challenges you can solve with us! We are looking for engineers to join a fast-paced team and solve complex problems in the domain of mission-critical distributed systems spanning data transmission across clouds. Our team works across all facets of isolated system engineering but is deeply involved in the following areas: service automation and reliability improvements, systemic latency reduction, data validation and transformation, and throughput optimization. We need you to help us overcome these challenges. In this role, you will have the opportunity to automate, build, deploy and support systems which enable a broad set of Azure services to be consumed by customers in highly secured and regulated industries. The systems you support will be required to meet the security policy and assurance requirements of both public and private sector customers. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesDevelops process or technology solutions that proactively resolve issues with processes, physical network devices, and/or tooling, and makes optimal use of infrastructure and resources through simple designs and by leveraging automation; prioritizes the development of solutions to deliver high-quality, measurable improvements against Key Performance Indicators (KPIs) across teams. Triages, troubleshoots, and repairs complex live site issues by applying expertise in physical network components and features (e.g., device operating systems), problem management tools (e.g., root cause analysis, trend analysis, postmortems, repair items), and/or low-level Application Programming Interfaces (APIs) and register sets, to diagnose and address problems using automated, long-term, and sustainable solutions with minimal or no disruption to customers. Participates in on-call/DRI duties to resolve incidents in production and provides guidance to other engineers on triage, troubleshooting, and resolution processes. Effectively manages multiple workstreams and resources during incidents, applies diagnostic expertise, provides guidance to other engineers working to mitigate and resolve issues, and maintains a commitment to the quality of products and services throughout the lifecycle; ensures proper notes from incidents are documented and drives the execution of quality postmortem and root cause analysis processes across teams. Performs analysis of historical incident data to identify trends, patterns and issues that should be addressed at high priority. Demonstrates knowledge of data: knows what data is needed, how to find new or missing data and how to describe the impact of defects on customers or the impact of operations-focused scenarios on networks or infrastructure, as well as the relevance to product and service targets; identifies patterns and trends in data and interprets them to inform decisions related to improving and optimizing products and/or services. Network Design and Implementation Leads design, network/code and security reviews across teams to identify risks and prevent classes of bugs prior to production release by applying expertise in network implementation, available technologies, analysis of telemetry pipelines, and root cause analysis, as well as best practices in identifying and implementing solutions. Articulates the customer impact of design trade-offs and exceptions and identifies capabilities and limitations of existing tools and resources to ensure they can support design implementation and verification. Works in collaboration with teams across a single organization to develop reliable, scalable, and high-performance, network designs; independently produces design documents and implementation plans. Supporting People and Execution Mentors and provides feedback to other engineers, while also proactively seeking mentorship and feedback from others; shares ideas and insights for improving team-oriented behaviors, including DevOps and live site handling skills. |