Onsite in San Francisco, RediRecruit invites a Software Engineer focused on Data and Evals to build and scale data infrastructure for large-scale AI and ML systems. This role centers on speech-to-speech data pipelines and productionizing research within a collaborative, in-person team environment.

Benefits

Competitive base salary of USD 180,000 to 250,000 per year
Significant equity as an early founding team member
Immigration support
Fully covered medical, dental, and vision insurance
401(k) retirement plan
In-person teamwork in San Francisco
Opportunity to work with 100+ TBs of data and large-scale AI models
Small team with high ownership and fast decision-making

Responsibilities

Build and scale infrastructure and distributed data pipelines for large-scale AI/ML systems
Process and manage tens to hundreds of terabytes of multimodal data
Support data systems used for training, evaluation, and improvement of speech-to-speech AI models
Work on batch processing, real-time streaming, and distributed orchestration systems
Build reliable pipelines for speech data transformation, filtering, evaluation, and model improvement
Collaborate closely with a small, high-performing engineering and research team
Help move cutting-edge AI research into real-world production environments

Requirements

Experience building infrastructure and distributed data pipelines to process tens to hundreds of terabytes of data
Experience working with multimodal data in AI/ML products or systems
Strong experience with batch processing, real-time streaming systems, and distributed orchestration
Hands-on experience with tools or technologies such as Spark, Kafka, Flyte, Kubernetes, or similar systems
Strong software engineering fundamentals and ability to build reliable, scalable systems
Demonstrated ability to learn quickly and adapt in a fast-paced startup environment
Strong ownership mindset and ability to work independently with high autonomy
Must be comfortable working in person with the team in San Francisco

Technologies

Spark
Kafka
Flyte
Kubernetes

Interview process

30 minute introductory conversation
Two technical interviews
Two culture interviews
Onsite collaboration session with the team to work on a data system

Nice to have

Early-stage startup experience
Independent project, startup, side hustle, or open-source work
Experience creating transformation pipelines for speech processing
Experience with transcription, diarization, speech enhancement, filtering, or audio data processing
Experience working with large-scale AI models or machine learning infrastructure
Interest in voice AI, speech systems, conversational AI, or multimodal AI products

Software Engineer (Data & Evals)

Job Description

Benefits

Responsibilities

Requirements

Technologies

Interview process

Nice to have

Similar Jobs

Principal AI / Machine Learning Data Engineer

Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer, Engineering Data Analytics

Senior Data Engineer (Chinese Mandarin Speaker)