Software Development Engineer (Java) for Distributed Data and Analytics Ecosystem

Akvelon, Inc. • Serbia
Remote
Apply
AI Summary

Join our team as a Software Development Engineer (Java) to contribute to an open-source table format project, improving interoperability with a major cloud-based data lake service. Enhance stream processing engine support, improve compatibility, and address authentication inconsistencies. Collaborate with distributed teams and contribute code, design discussions, and reviews within the open-source community.

Key Highlights
Contribute to open-source development related to distributed data table format specifications and ecosystem tooling
Improve stream processing engine support to align with the latest table format specification version
Enhance compatibility between cloud-based data lake services and open-source table format tooling
Key Responsibilities
Contribute to open-source development related to distributed data table format specifications and ecosystem tooling
Improve stream processing engine support to align with the latest table format specification version
Enhance compatibility between cloud-based data lake services and open-source table format tooling
Identify and address authentication and authorization gaps in OSS integrations, especially related to cloud identity mechanisms
Implement community-approved specification features that are not yet available in current implementations
Analyze and close behavioral gaps between formal specifications and real-world tooling implementations
Technical Skills Required
Java Distributed data systems Large-scale data processing frameworks GCP Apache Iceberg Apache Flink Cloud environments Authentication mechanisms in distributed systems
Benefits & Perks
Work conditions: Overlap until 10 AM PST
B2B contact
Remote/Hybrid work format
Locations: Serbia, Poland, Croatia, Portugal, Kazakhstan, Armenia, Georgia, Egypt OR Locations with CET hours/Remote
Nice to Have
Experience with cloud-based data lake services and lakehouse architectures
Familiarity with catalog federation concepts and metadata management systems
Understanding of specification-driven development and standards alignment
Previous experience working in managed service or vendor-based delivery models
Experience optimizing performance and scalability of distributed data processing systems

Job Description


We are looking for 3 Software Development Engineers (Java) to join a managed service engagement supporting a large-scale data and analytics ecosystem. The team will contribute to an open-source table format project widely used in distributed data processing systems, with the goal of improving interoperability with a major cloud-based data lake service. The initial scope includes enhancing stream processing engine support for the latest table format specification, improving compatibility between cloud-native lakehouse capabilities and open-source tooling, addressing authentication inconsistencies in OSS integrations, implementing already-approved specification features, and closing functional gaps between specification and tooling behavior.


Tasks

  • Contribute to open-source development related to distributed data table format specifications and ecosystem tooling;

  • Improve stream processing engine support to align with the latest table format specification version;

  • Enhance compatibility between cloud-based data lake services and open-source table format tooling;

  • Identify and address authentication and authorization gaps in OSS integrations, especially related to cloud identity mechanisms;

  • Implement community-approved specification features that are not yet available in current implementations;

  • Analyze and close behavioral gaps between formal specifications and real-world tooling implementations;

  • Collaborate with distributed teams and contribute code, design discussions, and reviews within the open-source community;

  • Ensure production-grade quality, maintainability, and performance of contributed components.


Requirements

  • 3+ years of experience as a Software Development Engineer with strong Java expertise;

  • Solid experience working with distributed data systems and large-scale data processing frameworks;

  • Experinence with GCP and its services;

  • Hands-on experience with Apache Iceberg AND/OR Apache Flink;

  • Strong understanding of cloud environments and authentication mechanisms in distributed systems;

  • Strong problem-solving skills and ability to work independently in a distributed team setup;

  • Good written and verbal communication skills in English.


Nice to have:



  • Experience with cloud-based data lake services and lakehouse architectures;

  • Familiarity with catalog federation concepts and metadata management systems;

  • Understanding of specification-driven development and standards alignment;

  • Previous experience working in managed service or vendor-based delivery models;

  • Experience optimizing performance and scalability of distributed data processing systems.


Benefits

Work conditions:



  • Overlap until 10 AM PST;

  • B2B contact

  • Remote/Hybrid work format

  • Company benefits


Locations: Serbia, Poland, Croatia, Portugal, Kazakhstan, Armenia, Georgia, Egypt OR Locations with CET hours/Remote


Similar Jobs

Explore other opportunities that match your interests

Customer Experience Specialist

Programming
•
19h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Smartcat

Serbia

Product Engineer (AI-First Systems)

Programming
•
4d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

BnBerry

Serbia

Event Manager

Programming
•
5d ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Not Applicable

Infomediji

Serbia

Subscribe our newsletter

New Things Will Always Update Regularly