|
When you're the best, we're the best. We instill an environment where employees feel engaged, satisfied and able to contribute their unique skills and talents while living and working as their authentic selves. We provide extensive opportunities for personal and professional development, building both employee competence and organizational capability to fuel exceptional performance through an inclusive environment both now and in the future. Summary In this role, the Director, AI Operations and Optimization will lead the operationalization, reliability, optimization, and continuous improvement of enterprise AI capabilities across Vizient. This leader is responsible for establishing scalable AI runtime operational practices, advancing AIOps and LLMOps capabilities, implementing observability and monitoring frameworks, and driving operational excellence for production AI solutions. The Director will oversee the operational support and continuous improvement of AI-powered applications, agentic workflows, and reusable AI platform capabilities while ensuring reliability, governance, security, and performance at enterprise scale. Through cross-functional collaboration and strong operational leadership, this role will help enable Vizient's enterprise AI transformation strategy by delivering sustainable, scalable, and responsible AI operations. Responsibilities: AI Runtime Operations & Reliability
- Lead enterprise AI operational activities, including runtime monitoring, operational support, incident management, production reliability, and operational continuity for AI-powered applications and intelligent automation solutions.
- Establish, implement, and continuously improve AI operational practices, including AIOps and LLMOps processes, runtime observability, operational telemetry, drift detection, release coordination, support workflows, and operational readiness activities.
- Drive runtime stability and service reliability initiatives through production monitoring, escalation management, root cause analysis, operational playbooks, and service continuity practices.
- Support enforcement of runtime governance standards, operational safeguards, human oversight controls, and secure operationalization practices for enterprise AI solutions.
- Ensure operational excellence across AI environments through proactive monitoring, issue prevention, and continuous service improvement efforts.
AI Optimization & Operational Maturity
- Lead initiatives focused on runtime efficiency, operational scalability, inference utilization, supportability, performance optimization, and sustainable AI operations.
- Support the implementation and optimization of reusable operational patterns, observability frameworks, support standards, telemetry pipelines, operational tooling, and AI support capabilities.
- Promote standardized operational processes, scalable support models, automation opportunities, and continuous improvement initiatives across AI operations functions.
- Drive operational maturity by identifying opportunities to enhance performance, reduce operational risk, and improve support effectiveness.
Cross-Functional Operational Coordination
- Partner closely with AI Engineering & Delivery, AI Governance, AI Quality Engineering, Automation, Architecture, Platform Engineering, Security, Infrastructure, and business stakeholders to ensure operational readiness and runtime reliability.
- Coordinate operational execution activities across AI operations teams, including operational planning, vendor and contractor management, issue prioritization, escalation management, knowledge transfer, and delivery continuity.
- Support operational assessments, production readiness reviews, implementation planning, runtime support strategies, and modernization initiatives for prioritized AI capabilities.
- Collaborate with technical and business leaders to align operational practices with enterprise AI objectives and service expectations.
Leadership, Communication & Team Development
- Lead, mentor, and develop operations managers, engineers, analysts, and contractor resources while fostering a high-performing, collaborative, and continuously learning culture.
- Provide clear communication regarding operational performance, runtime risks, service reliability concerns, optimization opportunities, engineering tradeoffs, and strategic recommendations.
- Establish accountability for operational outcomes while promoting operational discipline, innovation, and continuous improvement.
- Research and evaluate emerging AI operational technologies, observability platforms, automation capabilities, optimization techniques, and runtime management practices to drive innovation and operational effectiveness.
Qualifications
- Bachelor's degree in Computer Science, Information Systems, Engineering, Technology Management, or a related field preferred.
- 8+ years of experience in AI operations, software engineering, platform operations, engineering delivery, DevOps, Site Reliability Engineering (SRE), infrastructure operations, or related enterprise technology functions required.
- 3+ years of experience leading operational teams, engineering support organizations, platform operations, or large-scale technology initiatives required.
- Hands-on experience supporting, operationalizing, monitoring, or optimizing production AI solutions utilizing large language models (LLMs), APIs, agentic workflows, orchestration frameworks, and modern AI engineering practices required.
- Strong experience implementing and scaling operational support models, observability practices, incident management processes, DevOps methodologies, runtime operations, or enterprise operational frameworks required.
- Experience with observability platforms, monitoring tools, incident management processes, runtime operations, CI/CD pipelines, and production support practices required.
- Experience leading distributed teams, managing contractors and vendors, and delivering operational initiatives within complex and evolving environments required.
- Experience with cloud platforms, APIs, data integration technologies, automation frameworks, monitoring solutions, DevOps tools, and modern operational toolsets required.
- Strong analytical, problem-solving, communication, presentation, stakeholder management, and cross-functional collaboration skills required.
- Demonstrated ability to manage multiple priorities in fast-paced, evolving, and operationally dynamic environments required.
- Experience supporting enterprise-scale AI, automation, digital transformation, or platform modernization initiatives preferred.
- Knowledge of AI governance, responsible AI principles, operational risk management, and production AI lifecycle management preferred.
#LI-JB1 Estimated Hiring Range: At Vizient, we consider skills, experience, and organizational needs in our compensation approach. Geographic factors may adjust the range estimate and hires typically fall below the top range. Compensation decisions are tailored to individual circumstances. The current salary range for this role is $117,600.00 to $206,000.00.
This position is also incentive eligible. Vizient has a comprehensive benefits plan! Please view our benefits here: http://www.vizientinc.com/about-us/careers
Equal Opportunity Employer: Females/Minorities/Veterans/Individuals with Disabilities The Company is committed to equal employment opportunity to all employees and applicants without regard to race, religion, color, gender identity, ethnicity, age, national origin, sexual orientation, disability status, veteran status or any other category protected by applicable law.
|