Backend Engineer - Content and Metadata

San Francisco OR Remote US/Canada
Engineering – Content Engineering
Full-Time
Our team
The Content Engineering team is broadly responsible for catalog management and content metadata at Scribd. Supplying supplementary data to ebook and audiobook pages? That's us. Ensuring that all user-uploaded documents are useful, accessible, and legally available? That's us. Creating pipelines that build clean and well-structured data for Search, Recommendations, and Data Science to build amazing features from? That's us. Analyzing user reading activity and translating them into publisher payouts? That's us. We're a spoke within Scribd, connecting many engineering, product, and publisher-focused teams through data.

The majority of the team is based in San Francisco but there's a strong and growing remote contingent as well (much like Scribd overall). We use tools that emphasize asynchronous communication (Slack, Gitlab, Jira, Google Docs) and are ready and able to jump on a video call when text doesn't cut it. Regardless of the medium, solid communication skills are a must. We operate with autonomy (developers closest to the code will make the most well-informed decisions) while holding ourselves and each other accountable for using good judgement when faced with each day's unique challenges.

Our technical work is divided between our user-facing Rails application and our offline data warehouse (where much of our processing is done on top of Spark). Many of the systems we're responsible for - document spam detection, document copyright detection, topic extraction and classification, sitemap generation, and translating user activity into publisher payouts, just to name a few - span both environments, so engineers regularly work within both. Though the tech stacks differ between environments, the engineering work in both is the same - create data pipelines to ingest, process, clean, and layout the metadata coming from publishers and other external sources, as well as create new metadata from our vast content base.

The role
A Backend Engineer on Content Engineering can take many forms:

You may be a relatively new college or boot camp graduate, looking for your first job where you can learn the ropes from a team of experienced professionals. You have a place here. 

You may have a few years of experience and are looking for your next challenge. You have a place here. 

You may have built out a few systems alongside senior engineers and are ready to take on ownership of feature delivery. You have a place here. 

We look for engineers that aspire to learn and grow, that thrive on constructive feedback, and know they’ll be ready to step up when the opportunity presents itself. 

Office or remote?
We have a wonderful new office in San Francisco, as well as smaller offices in Toronto and New York. If you live close to one of those you'll find great people and a nice work environment.

If you don't live near one of those offices, we'd still love to have you! Scribd is expanding its remote workforce with the goal of finding the best employees regardless of location. Being a remote employee means providing your own productive work environment. Being a remote employee means providing your own productive work environment, and everything else is pretty similar to being an office employee. We expect remote employees to have solid communication skills, good judgement, and demonstrable personal responsibility. We also expect the same from our in-office employees, so you'll be in good company.

Nitpicky requirements
Backend Engineers on Content Engineering typically have:
0-6+ years of experience as a professional software engineer
Experience or a strong interest in backend systems and data pipelines
Bachelor’s in CS or equivalent professional experience

We present these in order to detail the picture of what we're looking for. Of course, every engineer brings something unique to the table, and we like nothing more than finding a diamond in the rough.