Role Overview
We are looking for a highly skilled Senior AI Engineer with hands-on experience in building and deploying AI/ML solutions using Large Language Models (LLMs). The ideal candidate will have strong expertise in Python, distributed data processing (PySpark), and modern AI frameworks, with a proven ability to design scalable and production-grade AI systems.
Required Skills
- 5+ years of experience in AI/ML Engineering or Data Science
- Strong programming expertise in Python
- Hands-on experience with LLMs (GPT, LLaMA, Mistral, etc.)
- Experience with PySpark for large-scale data processing
- Solid understanding of NLP concepts and transformer architectures
- Experience with:
- LangChain / LlamaIndex
- Vector databases (FAISS, Pinecone, Weaviate)
- REST APIs / FastAPI
- Knowledge of model deployment (Docker, Kubernetes, APIs)
Key Responsibilities
- Design, develop, and deploy AI/ML models with a focus on LLMs (GPT, LLaMA, etc.)
- Build and optimize LLM-based applications such as chatbots, summarization, and semantic search systems
- Work with large-scale datasets using PySpark for data processing and feature engineering
- Implement RAG (Retrieval-Augmented Generation) pipelines using vector databases
- Fine-tune and evaluate LLMs using frameworks like Hugging Face / OpenAI APIs
- Develop scalable backend services using Python (FastAPI/Flask)
- Collaborate with data engineers and product teams to deliver end-to-end AI solutions
- Ensure model performance, monitoring, and continuous improvement in production
- Stay updated with the latest advancements in Generative AI and LLM ecosystems