R
Software Engineer (Data & Evals)
Job Description
Onsite in San Francisco, RediRecruit invites a Software Engineer focused on Data and Evals to build and scale data infrastructure for large-scale AI and ML systems. This role centers on speech-to-speech data pipelines and productionizing research within a collaborative, in-person team environment.
Benefits
- Competitive base salary of USD 180,000 to 250,000 per year
- Significant equity as an early founding team member
- Immigration support
- Fully covered medical, dental, and vision insurance
- 401(k) retirement plan
- In-person teamwork in San Francisco
- Opportunity to work with 100+ TBs of data and large-scale AI models
- Small team with high ownership and fast decision-making
Responsibilities
- Build and scale infrastructure and distributed data pipelines for large-scale AI/ML systems
- Process and manage tens to hundreds of terabytes of multimodal data
- Support data systems used for training, evaluation, and improvement of speech-to-speech AI models
- Work on batch processing, real-time streaming, and distributed orchestration systems
- Build reliable pipelines for speech data transformation, filtering, evaluation, and model improvement
- Collaborate closely with a small, high-performing engineering and research team
- Help move cutting-edge AI research into real-world production environments
Requirements
- Experience building infrastructure and distributed data pipelines to process tens to hundreds of terabytes of data
- Experience working with multimodal data in AI/ML products or systems
- Strong experience with batch processing, real-time streaming systems, and distributed orchestration
- Hands-on experience with tools or technologies such as Spark, Kafka, Flyte, Kubernetes, or similar systems
- Strong software engineering fundamentals and ability to build reliable, scalable systems
- Demonstrated ability to learn quickly and adapt in a fast-paced startup environment
- Strong ownership mindset and ability to work independently with high autonomy
- Must be comfortable working in person with the team in San Francisco
Technologies
- Spark
- Kafka
- Flyte
- Kubernetes
Interview process
- 30 minute introductory conversation
- Two technical interviews
- Two culture interviews
- Onsite collaboration session with the team to work on a data system
Nice to have
- Early-stage startup experience
- Independent project, startup, side hustle, or open-source work
- Experience creating transformation pipelines for speech processing
- Experience with transcription, diarization, speech enhancement, filtering, or audio data processing
- Experience working with large-scale AI models or machine learning infrastructure
- Interest in voice AI, speech systems, conversational AI, or multimodal AI products