DeveloperJobs.io
← Back to all jobs

Job Description

LogicRays Technologies Pvt. Ltd offers an onsite AI Developer role in Iselin, NJ, with an hourly rate of USD 10 to 20. This position centers on solutioning, architecture design, and prototype integrations between partner products and our platform, with a focus on agentic pipelines, RAG architectures, and scalable production integration. The role invites you to collaborate across partner engineering teams and internal platform groups to turn field insights into robust, production-ready solutions.

Responsibilities

  • Design and implement solution architectures and prototypes that integrate partner products with the company platform
  • Define reference architectures for partner integrations
  • Scope partner architectures against our platform
  • Analyze how our stack supports the product, identifying integration points and potential gaps
  • Build production-quality proofs of concept across the AI stack including agentic pipelines, RAG architectures, inference optimization patterns, and multi-model orchestration
  • Produce working proofs of concept that serve as the starting point for product development
  • Maintain a library of reference architectures and integration patterns for internal teams to leverage
  • Collaborate directly with partner engineering teams to scope, prototype, and progress integrations
  • Assess partner architectures objectively
  • Report integration pain points and gains; provide technical guidance to partners on performance, reliability, and cost efficiency on company infrastructure
  • Create technical scoping that gives teams a clear view of integration feasibility, depth, and complexity
  • Translate external integration findings into actionable product requirements for company platform teams
  • Engage with ISV partners, SI teams, and field teams to scale solution adoption and drive revenue when a solution is ready
  • Highlight recurring architectural patterns and gaps to inform platform roadmap decisions
  • Participate in platform planning as the technical voice reflecting field experiences
  • Represent the company at hackathons, open source communities, and technical events; contribute to building in public
  • Deliver demos, reference architectures, and integrations that position the company as a preferred platform for AI builders
  • Stay current with the AI tooling ecosystem
  • Agentic focus areas: agent frameworks, memory systems, tool integration, orchestration, MCP, guardrails
  • Managed Inference focus areas: inference runtimes, model serving, optimization tooling, speculative decoding, KV-cache routing
  • IaaS / Managed Infrastructure focus areas: cloud-native integrations, GPU orchestration, enterprise platform connectors
  • Data focus areas: vector databases, retrieval systems, RAG architectures, data pipeline integrations, synthetic data tooling

Requirements

  • 6+ years of hands-on engineering experience in AI application development, ML systems, or AI infrastructure
  • Deep working knowledge of the AI developer stack
  • LLM APIs, inference runtimes, orchestration frameworks, vector databases, RAG architectures, agentic pipelines
  • Hands-on experience with agentic frameworks such as LangChain, LangGraph, CrewAI, AutoGen, or equivalents
  • Strong Python programming skills and comfort prototyping end-to-end AI systems quickly
  • Experience defining reference architectures and technical patterns
  • Proven ability to move from idea to working prototype fast and shipping meaningful work under time pressure
  • Experience building integrations across APIs and developer platforms
  • Comfortable working across external partner engineering teams and internal company product and engineering teams simultaneously
  • Strong technical communication
  • Ability to explain architecture decisions and integration findings to a founding CTO and a non-technical partner lead in the same day
  • Experience with inference frameworks and optimization: vLLM, SGLang, TensorRT-LLM, speculative decoding, quantization, batching, KV-cache routing
  • Familiarity with NVIDIA's software stack: CUDA, TensorRT, NeMo, or equivalent
  • Experience with multimodal AI models
  • Won or placed at major AI hackathons in the past 12 months
  • Experience as a developer advocate, solutions engineer, or technical partner manager at a leading AI platform or developer tooling company
  • Been an early engineer at a YC-backed AI startup
  • Open source projects or public demos with meaningful community adoption
  • Work Authorization: Open to Green Card holders and U.S. Citizens

Technologies

  • Docker, Kubernetes, Git, Python
  • vLLM, SGLang, TensorRT-LLM, Transformers
  • OpenAI SDKs, Anthropic SDKs, LangChain, LangGraph, CrewAI, AutoGen
  • smolagents, Qdrant, Weaviate, Milvus, pgvector
  • FastAPI, Flask
  • AWS, GCP, Azure
  • CUDA, TensorRT, NeMo

Work Authorization

Open to Green Card holders and U.S. Citizens

Location and Compensation

Location: Iselin, New Jersey — onsite. Compensation: USD 10.00 to 20.00 per hour.

Similar Jobs

Get Job Alerts

New jobs delivered to your inbox.