Senior Software Engineer, Core Storage and Integration

Remote /
Engineering – Data Infrastructure /
Full-time
/ Remote
Labelbox’s mission is to build the best products to align with artificial intelligence. Real breakthroughs in AI are reliant on the quality of the training data. Labelbox's data engine enables organizations to dramatically improve the quality of their training data, which makes their machine learning models more accurate and performant. We are determined to build software that is more open, easier-to-use, and singularly focused on helping our customers get to production AI faster.

Current Labelbox customers are transforming industries within insurance, retail, manufacturing/robotics, healthcare, and beyond. Our platform is used by Fortune 500 enterprises including Allstate, Black + Decker, Bayer, Warner Brothers and leading AI-focused companies including FLIR Systems and Caption Health. We are backed by leading investors including SoftBank, Andreessen Horowitz, B Capital, Gradient Ventures (Google's AI-focused fund), Databricks Ventures, Snowpoint Ventures and Kleiner Perkins.

About the Data Infrastructure Team

We are building backend infrastructure that does data curation of large data sets in order to search, visualize, explore, and analyze all labeled and unlabeled data, metadata, and model inferences. Our Catalog product empowers analysts and machine learning teams to ingest, process, curate and understand data at scale in order to make the best decisions.

About The Role

As our Senior Engineer, you will own a highly available storage platform that is operationally simple, scales to billions of objects and has predictable read and write performance. You will also own integration with cloud storage and customer data lakes creating seamless solutions for our customers.

Responsibilities and Requirements

    • You have owned, designed, built, delivered and maintained complex systems and integrations.
    • You have excellent communication skills with the ability to communicate verbally and in writing complex technical concepts and plans to engineers and executives.
    • You value correctness and hate re-work so you write code with high test coverage.
    • You have created secure, scalable storage layers that store billions of records, offer consistent query performance, and reasonable transactional guarantees at hundreds of thousands of queries per second.
    • You take the lead on the critical escalations, drive them to resolution, and see them as opportunities to improve. You always choose automation over toil.
    • You thrive by empowering other engineering and product teams to identify their data access needs and deliver APIs for them to efficiently access it.
    • 5+ years of relevant industry experience in a modern language (Typescript, Python, Golang, Java, etc) delivering consistent low latency services for users around the world.
    • Expertise in storage scaling techniques, both vertical and horizontal.
    • Experience with Google Cloud, AWS, or other public cloud platforms, hands-on experience with Redis, Kubernetes.
    • Experience doing integrations with cloud buckets (S3, GCS, etc.)

    • Experience doing integrations with Databricks is a plus.
    • Experience with ETL mechanisms is a plus.
    • Experience with Spanner and Kotlin is a plus.
    • Experience with messaging systems e.g. Kafka and Google pub/sub is a plus.
Labelbox strives to ensure pay parity across the organization and discuss compensation transparently. The expected annual base salary range for this United States based position is $170,000 - $215,000. This range is not inclusive of any potential equity packages or additional benefits. Exact compensation varies based on a variety of factors, including skills and competencies, experience, and geographical location.

Do great work. From anywhere.

We hire great people regardless of where they live. Work wherever you’d like as reliable internet access is our only requirement. We communicate asynchronously, work autonomously, and take ownership of our work.

#LI-Remote