Ai Engineer - Hybrid Rag Solution (Llm & Rag)

Detalles de la oferta

Job Summary: We are looking for an experienced AI Engineer specializing in Retrieval-Augmented Generation (RAG) to build and optimize hybrid AI solutions leveraging Large Language Models (LLMs). This role involves working with cutting-edge language models and retrieval systems to deliver highly accurate, context-aware, and responsive AI applications. You'll collaborate with cross-functional teams to develop scalable solutions that enhance information retrieval, comprehension, and generation capabilities in real-world applications. Key Responsibilities: Design, develop, and deploy hybrid RAG architectures integrating LLMs with retrieval-based systems for improved relevance and contextual responses. Fine-tune and optimize large language models, enhancing their performance and adaptability to domain-specific requirements. Implement and manage RAG pipelines that effectively combine retrieval mechanisms with generative capabilities, ensuring high accuracy and efficiency. Develop custom plugins, adapters, or APIs to integrate retrieval systems (e.g., Elasticsearch, FAISS) with generative models, facilitating seamless information retrieval. Monitor and troubleshoot issues within RAG pipelines, fine-tuning retrieval parameters and model hyperparameters to optimize performance. Work closely with data engineers to manage and preprocess large datasets for training, ensuring high-quality and diverse data coverage. Evaluate and benchmark the performance of RAG solutions, using metrics such as response accuracy, latency, and user satisfaction. Stay up-to-date with advancements in NLP, LLMs, and RAG methodologies, continually improving existing architectures and recommending new techniques. Qualifications: Bachelor's or Master's degree in Computer Science, Artificial Intelligence, or a related field, or equivalent practical experience. 3+ years of experience in AI/NLP, with a focus on LLMs, transformer-based architectures, and retrieval systems. Proven experience building and deploying RAG solutions or other hybrid AI architectures. Strong understanding of information retrieval methods, including dense retrieval, sparse retrieval, and embeddings-based techniques. Proficiency in Python, TensorFlow or PyTorch, and experience with libraries and tools related to LLMs, such as Hugging Face Transformers. Familiarity with retrieval frameworks like Elasticsearch, FAISS, or OpenSearch. Knowledge of prompt engineering, fine-tuning, and deployment of language models for production environments. Strong analytical skills, with experience in optimizing LLM and retrieval model performance. English required. Preferred Skills: Experience with cloud services and infrastructure (AWS, GCP, Azure) and MLOps tools for model deployment and monitoring. Contributions to open-source RAG projects or experience working with OpenAI, LangChain, or similar frameworks. Knowledge of vector databases, memory-augmented networks, and distributed systems. #J-18808-Ljbffr

Salario Nominal: A convenir

Fuente: Whatjobs_Ppc

Requisitos

Ofertas Similares

Ver más ofertas similares

Residente De Interventoría

se solicita ingeniero civil con 4 años de experiência profesional a partir de la expedición de la tarjeta profesional, en la cual halla desempeñado como resi...

Cubiko Obras Y Consultoria Sas - Bogotá D. C.

Publicado a month ago

Network Engineer

**Descripción General** Es responsable de planear, desarrollar y ejecutar proyectos de telecomunicaciones en el ámbito de redes de datos IP/MPLS y redes de ...

Ufinet - Bogotá D. C.

Publicado a month ago

Tecnico De Mantenimiento

TECNICO DE MANTENIMIENTORANSA COLOMBIA SAS in Avenida Carrera #22-31, Fontibón, Bogotá, ColombiaStarts 26 NovWhat you'll earn$1.. About the jobEmpresa líder ...

Jobandtalent - Bogotá D. C.

Publicado a month ago

Técnico En Mantenimiento Locativo

Se necesita Técnico en mantenimiento locativo, con énfasis en instalación de Drywall, estuco, pintura, plomería, enchape cerámico, cerrajería y varios. dispo...

Bogotá D. C.

Publicado a month ago

Built at: 2024-12-23T03:11:02.353Z