Founding Senior Data Engineer - Public Real Estate Auction Intelligence Pipeline

venduetech Ireland
Remote
Apply
AI Summary

Join VendueTech in building the data backbone for public real estate auction intelligence. Design and build scalable ETL/ELT pipelines. Collaborate with AI/NLP engineers, backend engineers, and product teams.

Key Highlights
Design and build scalable ETL/ELT pipelines for ingesting structured and unstructured data
Build AI-native data pipelines for extracting and normalizing auction data
Collaborate with AI/NLP engineers, backend engineers, and product teams
Key Responsibilities
Design and build scalable ETL/ELT pipelines for ingesting structured and unstructured data from public, legal, and government sources
Build AI-native data pipelines where LLMs extract and normalize structured auction data from noisy, incomplete, and multilingual inputs
Help create a self-improving data flywheel in which reviewer feedback, operational learnings, and user signals continuously improve extraction accuracy and data quality
Architect and evolve the company's data lake and data warehouse to support analytics, machine learning, internal tooling, and customer-facing APIs
Technical Skills Required
Apache Spark PySpark Pandas PostgreSQL data lakes data warehouses Apache Kafka SQL Python machine learning AI NLP
Benefits & Perks
Part-time role with flexibility around existing commitments
Fully remote setup
Equity-only opportunity through stock options
Chance to help build high-impact data and AI platform
Nice to Have
Experience with web scraping or public-data ingestion pipelines
Experience with NLP pipelines, LLM-based extraction, or AI-assisted document and data processing
Experience with FastAPI, microservices, Terraform, and infrastructure-as-code practices

Job Description



Founding Senior Data Engineer


Part-time · Remote · Flexible · ESOP / Stock Options Only

VendueTech Ltd.



About VendueTech


At VendueTech, we are building the data and AI infrastructure powering the future of public real estate auctions.


We transform fragmented legal, financial, and government data into structured, real-time intelligence that helps customers make faster, smarter decisions. Our goal is to build the data backbone for public real estate auction intelligence across markets, combining large-scale data engineering, AI-driven extraction, and product delivery into one scalable platform.


This is an exciting time to join. VendueTech has already secured a Eurostars R&D grant, and we are currently progressing through the EIC Accelerator long proposal process. For the right person, this creates meaningful upside: the opportunity to contribute at an early stage, shape core infrastructure, and participate in long-term value creation through the VendueTech Ltd. ESOP stock options program.



About the role


We are looking for a Founding Senior Data Engineer to help design and build the core data platform behind VendueTech’s AI-driven auction intelligence pipeline.


This is a highly strategic and foundational role. You will shape how data flows through the platform and directly influence the capabilities of current and future VendueTech products.


This is a part-time, fully remote, and highly flexible opportunity, designed for experienced engineers who want to contribute alongside their current work, consulting, startup, or academic commitments. The role is currently structured as equity-only, through participation in the VendueTech Ltd. ESOP program.



What you will work on


Our platform aggregates public real estate auction data across multiple countries and transforms it into actionable intelligence. You will work at the intersection of data engineering, AI, analytics, and product development to build a robust and scalable data backbone for cross-border auction intelligence.



Responsibilities


  • Design and build scalable ETL / ELT pipelines for ingesting structured and unstructured data from public, legal, and government sources
  • Build AI-native data pipelines where LLMs extract and normalize structured auction data from noisy, incomplete, and multilingual inputs
  • Design and implement human-in-the-loop workflows that route low-confidence outputs to reviewers, capture corrections, and create continuous feedback loops for model and data quality improvement
  • Help create a self-improving data flywheel in which reviewer feedback, operational learnings, and user signals continuously improve extraction accuracy and data quality
  • Architect and evolve the company’s data lake and data warehouse to support analytics, machine learning, internal tooling, and customer-facing APIs
  • Define canonical data models for auctions, properties, legal events, and transactions across multiple jurisdictions
  • Build and maintain ML-ready datasets and data layers for downstream use cases such as predictive pricing, risk scoring, and recommendation systems
  • Collaborate closely with AI / NLP engineers, backend engineers, product teams, and domain experts to interpret source data and turn it into scalable systems
  • Contribute to API design and customer-facing data delivery layers
  • Improve data observability, validation, lineage, reliability, monitoring, and performance across the platform
  • Help establish best practices for data architecture, testing, deployment, governance, and documentation in a startup environment



Required qualifications


  • 5+ years of experience in data engineering, data platform engineering, or similar roles
  • Proven experience designing and building scalable production-grade data pipelines and data models
  • Strong SQL skills and solid Python experience for data engineering workflows
  • Hands-on experience with data processing frameworks such as Apache Spark, PySpark, Pandas, or similar tools
  • Experience working with PostgreSQL, data lakes, and data warehouses
  • Experience building real-time or near-real-time data pipelines using Apache Kafka or similar streaming technologies
  • Strong understanding of distributed systems, pipeline orchestration, reliability, and performance tuning
  • Familiarity with machine learning workflows and the data requirements needed to support AI / ML systems in production
  • Strong problem-solving skills and the ability to work independently in a remote environment
  • Good communication skills and the ability to collaborate across technical and non-technical teams



Preferred qualifications


  • Experience with web scraping or public-data ingestion pipelines
  • Experience with NLP pipelines, LLM-based extraction, or AI-assisted document and data processing
  • Familiarity with real estate, legal-tech, fintech, or public-sector datasets
  • Familiarity with FastAPI, microservices, Terraform, and infrastructure-as-code practices
  • Startup experience, especially building 0→1 systems
  • Experience designing QA, review, or feedback-loop systems for AI / LLM outputs
  • Familiarity with BI, dashboarding, and observability tools such as Apache Superset, Grafana, and Kibana



What this opportunity offers


  • A part-time role with flexibility around your existing commitments
  • A fully remote setup
  • A flexible working model focused on outcomes and contribution rather than fixed hours
  • Participation in the VendueTech Ltd. ESOP stock options program
  • The chance to help build a high-impact data and AI platform from an early stage
  • Direct collaboration with a multidisciplinary team across AI, engineering, product, and business
  • Meaningful ownership in shaping the core infrastructure of the company
  • The opportunity to join at a stage where the company has already achieved important external validation, with substantial room for future upside



Important note on compensation


This role is currently structured as an equity-only opportunity through stock options under the VendueTech Ltd. ESOP program. It is best suited for someone who believes in the long-term vision, wants meaningful upside, and is excited to help build foundational technology in an early-stage startup environment.



Why join now


  • VendueTech is building infrastructure for a large and fragmented market where high-quality data is difficult to access, normalize, and operationalize. By joining now, you will have the opportunity to influence core architectural decisions, work on technically ambitious AI and data challenges, and contribute to a company building long-term value in the real estate intelligence space.

Similar Jobs

Explore other opportunities that match your interests

Regional Staffing Manager

Data Science
17h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

US Foods

United State

Data Analyst

Data Science
21h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Entry level

Stier Solutions Inc

India
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

grades buddy

India

Subscribe our newsletter

New Things Will Always Update Regularly