Lead the design, development, and operational excellence of our AI/ML infrastructure. Drive architectural initiatives to ensure exceptional scalability, performance, and reliability. Establish technical standards and mentor senior engineers.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Job Description
About The Company
Join DigitalOcean, a leading cloud infrastructure provider dedicated to simplifying cloud computing for developers and businesses worldwide. Our mission is to empower creators by providing straightforward, scalable, and reliable cloud solutions that foster innovation and growth. As a disruptor in the industry, we pride ourselves on fostering a culture of continuous learning, collaboration, and excellence. Our community of top talent is relentless in their drive to build the simplest and most effective cloud platform, making a profound impact on the world of technology. With a focus on innovation, customer success, and a supportive work environment, DigitalOcean is committed to helping our employees thrive and make a meaningful difference.
About The Role
We are seeking a highly skilled and visionary Lead Architect for our Gradient AI platform. This role involves driving the design, development, and operational excellence of our AI/ML infrastructure, with a focus on creating an intuitive agent development experience. You will lead architectural initiatives to ensure our platform delivers exceptional scalability, performance, and reliability. As a key member of our engineering leadership team, you will establish technical standards, mentor senior engineers, and collaborate cross-functionally with product managers, customer-facing teams, and business leaders. Your expertise will help shape the future of AI agent development within DigitalOcean, enabling us to deliver innovative solutions that meet the evolving needs of our customers and the industry at large.
Qualifications
- Hands-on experience designing and operating production-grade AI/ML platforms utilizing the latest GenAI and agent-development technologies.
- 10+ years of experience in designing and building cloud applications, with at least 5+ years specifically in AI/ML platform development.
- Proven leadership experience as a technical visionary in large-scale, mission-critical projects.
- Deep expertise in operational excellence, automation, and best practices for scalable and reliable systems.
- Strong communication skills with a demonstrated ability to mentor engineers and translate complex concepts across technical and business teams.
- Experience in establishing and enforcing technical standards, coding practices, and infrastructure guidelines.
- Ability to work effectively in a remote environment and collaborate with diverse teams.
Interested in remote work opportunities in Machine Learning & AI? Discover Machine Learning & AI Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
- Design and evolve the architecture for our agent development experience, including code integration, evaluations, observability, and cross-agent interactions.
- Lead initiatives to optimize the architecture for scalability, reliability, low-latency, and cost efficiency.
- Manage and enhance our benchmarking systems to continuously improve platform performance and user experience.
- Take a hands-on role in rolling out new services, ensuring timely delivery and high quality standards.
- Establish and enforce technical standards, best practices, and tooling across AI/ML engineering teams.
- Mentor senior engineers, fostering a culture of architectural rigor, operational excellence, and innovation.
- Collaborate with product managers and stakeholders to translate strategic objectives into technical roadmaps.
- Guide customer-facing teams in AI modernization initiatives via agents, ensuring alignment with platform capabilities.
- Lead operations excellence efforts, including availability, performance tuning, capacity planning, and disaster recovery.
- Drive AI-driven automation across deployment pipelines, monitoring, and infrastructure management.
- Develop internal tooling leveraging agents to improve engineering efficiency and quality.
- Serve as a subject matter expert on new agent development paradigms and lead their implementation to productize innovative solutions.
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
- Competitive salary within the range of $272,448 - $340,560, commensurate with experience and skills.
- Remote work flexibility, allowing you to work from anywhere.
- Comprehensive health and wellness benefits, including Employee Assistance Programs.
- Opportunities for professional growth and career development within a forward-thinking organization.
- Participation in local meet-ups, training sessions, and community events.
- Flexible time-off policies to support work-life balance.
- Performance-based bonuses and incentives.
DigitalOcean is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate based on race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender identity or expression, age, disability, medical condition, pregnancy, genetic information, marital status, or military service. We believe that a diverse and inclusive workforce enhances our innovation and overall success.
Similar Jobs
Explore other opportunities that match your interests
cartol
Head of Artificial Intelligence
modern health