Staff Software Engineer

Foster City, California
Engineering /
Full-time /
Hybrid
About Alluxio
Alluxio powers the data layer for modern AI and analytics. Proven in production at eight of the top ten internet companies and seven of the ten highest-valued enterprises globally, Alluxio’s data orchestration platform unifies data across storage systems, regions, and clouds providing a high-performance distributed caching layer built for large-scale AI workloads.

Spun out of UC Berkeley’s AMPLab by the creators of Tachyon and backed by Andreessen Horowitz, Hillhouse Capital, and Seven Seas Partners, Alluxio sits at the intersection of data, distributed systems, and AI infrastructure.

Our technology is deployed at scale by organizations such as Meta, Uber, Tencent, TikTok, Alibaba, Expedia, Rakuten, Microsoft, and Walmart, orchestrating data for billions of operations per day. Learn more at alluxio.io or on Wikipedia.

The Role
We’re looking for experienced distributed-systems engineers to join our Core Product team and advance the next generation of Alluxio’s data-orchestration engine - the foundation for AI and analytics at global scale.

As a Staff Software Engineer, you’ll work on high-impact systems problems such as:
1. Optimizing metadata management, caching, and replication across thousands of nodes.
2. Designing concurrent, fault-tolerant services for multi-region and multi-cloud environments.
3. Evolving Alluxio’s storage abstraction and scheduling layer to support large-scale AI/ML data pipelines.
4. Collaborating with internal product teams to push the limits of distributed I/O performance.

This is a hands-on, architecture-plus-implementation role for engineers who love deep systems work and want visible impact in a small, senior, highly technical team.

What You’ll Own

    • Cache and metadata consistency - advance Alluxio’s intelligent caching framework for multi-tenant environments (TTL policies, write-back consistency, invalidation protocols, and distributed metadata scaling).
    • High-throughput data I/O optimization - profile and optimize Alluxio’s data path across S3, GCS, HDFS, and POSIX interfaces using adaptive prefetching, async I/O, and tier-aware scheduling.
    • Scaling for AI and analytics workloads - evolve the coordination layer to efficiently serve distributed AI training clusters, accelerating model load and shuffle operations across regions and clouds.
    • Observability and performance insights - build fine-grained metrics and tracing for cache efficiency, throughput, and latency across storage tiers.
    • Open-source leadership - drive design discussions, mentor contributors, and represent Alluxio’s core-systems direction within the OSS community.

What You’ll Do

    • Design and implement core components of Alluxio’s distributed file and object-access layer.
    • Optimize performance for large-scale, high-throughput environments using advanced concurrency and caching techniques.
    • Build scalable metadata and coordination systems that ensure strong consistency, high availability, and minimal latency.
    • Collaborate cross-functionally with product, solution-engineering, and research teams to drive roadmap and customer success.

What We’re Looking For

    • Strong computer-science fundamentals and a passion for large-scale distributed systems.
    • Professional experience developing in Java, C++, or Go.
    • Deep understanding of concurrency, replication, fault tolerance, and performance optimization.
    • Experience with distributed storage, data-access layers, or cloud infrastructure (e.g., Spark, Presto, Hadoop, Kubernetes).
    • Bachelor’s or advanced degree in Computer Science or related technical field (or equivalent experience).
    • Demonstrated technical leadership: defining architecture, mentoring peers, or driving major projects from design through release.

Why Alluxio

    • Build infrastructure trusted by the world’s largest AI and data-driven companies.
    • Join a small, senior engineering team where your designs shape the product’s evolution.
    • Work directly with the original creators of open-source Alluxio.
    • A culture of empathy, curiosity, and ownership - where engineers collaborate closely to solve hard problems.
Alluxio is an equal opportunity employer and does not discriminate in employment on the basis of race, color, religion, sex (including pregnancy and gender identity), national origin, political affiliation, sexual orientation, marital status, disability, genetic information, age, membership in an employee organization, retaliation, parental status, military service, or other non-merit factors.

The base salary range for this US full-time position is $200,000 - $235,000, depending on experience, and subject to standard withholding and applicable taxes. All candidates receive equity (stock options) and access to a comprehensive benefits offering.  The base salary range reflects the minimum and maximum target for candidates across all US locations. Work location, skills, experience, and any relevant education or training determine the compensation awarded to the candidate. The Recruiting Team or Hiring Manager can share more about the specific salary range with you during the recruitment process.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.