Description
We are seeking a highly skilled and motivated AI/ML Engineer with a strong foundation in programming and data engineering. The ideal candidate will be responsible for designing, developing, and deploying AI/ML models while ensuring seamless data integration and pipeline optimization. The role requires a blend of hands-on machine learning expertise and software engineering proficiency to drive intelligent insights and business solutions.
Job Responsibilities
- Building complex multi-agent AI agents that are highly scalable
- Work with LLMs, embedding models, and Retrieval-Augmented Generation (RAG) systems.
- Engineer and refine prompts to enhance AI performance and output quality.
- Deploy and scale AI solutions using AWS (Lambda, cloud services) and modern architectures.
- Ensure AI applications align with ethical standards, data privacy, and real-world scalability.
- Develop, fine-tune, and optimize generative AI models using TensorFlow, PyTorch, or Hugging Face.
Desired Skills, Expertise and Knowledge
- Work with current state of the art LLMs and embedding models.
- Experience building agentic AI systems.
- Experience with debugging traces of LLM calls to identify errors/optimizations.
- Experience with building Retrieval-Augmented Generation (RAG) systems.
- Engineer and refine prompts to enhance AI performance and output quality.
- Knowledge of extracting structured outputs from LLMs.
- Experience using LLM APIs, embedding models, and RAG-based AI
architectures. - Strong skills in Python, AI model deployment, and AWS services (Lambda
preferred). - Knowledge of LangChain, Pydantic, and scalable AI workflows.
- Proficiency in prompt engineering and optimization techniques.
- Some UI/UX experience is a plus.
Preferred:
- Experience in NLP, computer vision, or multimodal AI.
- Proven track record of deploying AI solutions at scale.
- Research background in generative AI models.