Senior Distributed System Engineer

New York
Engineering /
Distributed System Engineer /
Hybrid
About NEBRA
NEBRA is a research & development organization dedicated to advancing the use of zero-knowledge proofs (ZKP) to enable a more private and scalable blockchain future.

NEBRA released NEBRA UPA, the first universal proof aggregation protocol that scales ZKP verification more than 10x on Ethereum and L2s. This marked the humble beginning of our journey and there are more exciting products in the pipeline.

NEBRA seeks a Distributed System Engineer to join our world-class team. You'll work with our research and cryptographic engineering experts to develop critical components for our decentralized cloud platform. This role focuses on building innovative solutions combining distributed computing automation, Kubernetes operators, and private networking code.

Key Responsibilities:

    • Manage GPU systems across cloud providers, from single-node to large clusters.
    • Implement GPU provisioning and lifecycle management using cloud-native solutions.
    • Develop and maintain scalable Go programs in Kubernetes environments.
    • Create next-generation multi-cloud infrastructure management systems.
    • Support users through bug fixes, documentation, and feature improvements.

Requirements:

    • BS or higher in Computer Science or equivalent experience.
    • 5+ years of industry experience with strong Kubernetes and SRE background.
    • Expertise in cloud-native ecosystem and modern distributed systems.
    • Proficiency in software development lifecycle
    • Business-level English communication skills.
    • Experience with GPU programming (CUDA) and Kubernetes internals.
    • Development of Kubernetes operators.
    • Implementation of network-as-code architectures and custom Kubernetes clusters.
    • GitOps stack implementation (preferably ArgoCD).

Preferred Experience:

    • VPN technologies: Tailscale, Headscale, Wireguard, Kilo.
    • CNI plugins: Calico, Cilium, Flannel.
    • Service mesh: Traefik, Istio.
    • Cloud-native storage operators (e.g., Rook/Ceph).
    • gRPC.
    • Additional languages: GPG, Python, Bash/Make.