Data Engineer (Data Pipelines & RAG)
About This Gig
Our client is a fast growing Property Tech AI company About the role They are seeking a versatile Data & AI Engineer to build, deploy & maintain end-to-end data pipelines for downstream Gen AI applications. You'll design data models and transformations, build scalable ETL/ELT workflows, while learning fast and working on the AI agent space. Key Responsibilities Data Modeling & Pipeline development Automate data ingestion from diverse sources (Databases, APIs, files, Sharepoint/ document management tools, URLs). Most files are expected to be unstructured documents with different file formats, tables, charts, process flows, schedules, construction layouts/drawings, etc. Own chunking strategy, embedding, indexing all unstructured & structured data for efficient retrieval by downstream RAG/agent systems Build, test, and maintain robust ETL/ELT workflows using Spark (batch & streaming) Define and implement logical/physical data models and schemas. Develop schema mapping
Skills & Tags
About the Seller
Hyred
on Himalayas