Applied Research Engineer - Video Understanding

pulserise technologies • United State
Visa Sponsorship
Apply
AI Summary

We are seeking an Applied Research Engineer to build high-performance pipelines and infrastructure to understand video with precision at internet scale. The role requires 5+ years of experience in computer vision or audio processing, strong Python skills, and hands-on experience with PyTorch. The ideal candidate will have a strong ownership mindset, clear communication skills, and experience building large-scale multimodal systems.

Key Highlights
Build scalable pipelines for video understanding
Work with large models and APIs, optimizing inference performance
Implement parallelization, pipelining, and inference optimization strategies
Key Responsibilities
Build scalable pipelines for video understanding
Work with large models and APIs, optimizing inference performance
Implement parallelization, pipelining, and inference optimization strategies
Occasionally fine-tune models where needed
Break down customer-level requirements into technical building blocks
Write clean, production-ready Python code
Collaborate with customers and external research teams
Contribute to the evolution of next-generation video datasets
Technical Skills Required
Python PyTorch Computer Vision Audio Processing
Benefits & Perks
H-1B, O-1, OPT sponsorship
On-site employment
Nice to Have
Published research
Open-source contributions

Job Description


Dear applicants, please keep in mind that applications without provided salary expectations and active LN profile will not be considered.

Hope for your understanding.

Location: San Francisco, CA

Employment Type: Full-Time ONSITE

Visa Sponsorship: H-1B, O-1, OPT supported

We are an AI research lab focused exclusively on video data. Video represents the dominant digital medium globally — powering creativity, communication, gaming, AR/VR, robotics, and beyond. The biggest bottleneck in advancing these systems is high-quality training data at scale.

Our team combines:

  • Exabyte-scale video infrastructure
  • Novel video understanding techniques
  • Large-scale multimodal datasets


We partner with leading AI labs and recently completed a Series A round backed by Tier 1 investors. The team is lean (≈12 people), high-signal, and operating at the frontier of multimodal AI.

As an Applied Research Engineer, you will build high-performance pipelines and infrastructure to understand video with precision at internet scale.

This role sits between research and production:

  • Not purely academic research
  • Not pure infrastructure engineering
  • You will work on ambiguous, open-ended problems in:
  • Computer Vision
  • Audio Processing
  • Multimodal (video + text + audio) systems


You’ll design clever techniques to extract signal from large-scale data while optimizing performance and cost.

What You’ll Do

  • Build scalable pipelines for video understanding
  • Work with large models and APIs, optimizing inference performance
  • Apply pre- and post-processing techniques to improve model precision
  • Implement parallelization, pipelining, and inference optimization strategies
  • Occasionally fine-tune models where needed
  • Break down customer-level requirements into technical building blocks
  • Write clean, production-ready Python code
  • Collaborate with customers and external research teams
  • Contribute to the evolution of next-generation video datasets


Requirements

  • 5+ years experience in computer vision or audio processing
  • Strong Python skills
  • Hands-on experience with PyTorch (or similar ML frameworks)
  • Experience working with large models or model APIs
  • Ability to optimize inference pipelines
  • Clear communication skills (technical + external stakeholders)
  • Strong ownership mindset
  • In-person presence in San Francisco
  • Experience building large-scale multimodal systems
  • Startup experience (early hire)
  • Open-source contributions
  • Published research (bonus, not required)
  • Demonstrated performance optimization work
  • Passion for video / media technologies


Interview Process

  • Initial Screen
  • Technical Discussion with CTO
  • Deep Technical Interview
  • Conversation with CEO
  • On-site
  • Offer

Similar Jobs

Explore other opportunities that match your interests

Forward Deployed Engineer

Programming
•
3h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

pulserise technologies

United State

Senior Risk Analyst, Payment Fraud

Programming
•
4h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

snaplii

United State

Engineering Manager - AI Agents

Programming
•
6h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Verkada

United State

Subscribe our newsletter

New Things Will Always Update Regularly