DataOps Engineer

Latin America
Engineering /
Full-time /
Remote
Who we are:
Factored was conceived in Palo Alto, California by Andrew Ng and a team of highly experienced AI researchers, educators, and engineers to help address the significant shortage of qualified AI & Machine-Learning engineers globally. ​We know that exceptional technical aptitude, intelligence, communication skills, and passion are equally distributed worldwide, and we are very committed to testing, vetting, and nurturing the most talented engineers for our program and on behalf of our clients. 

We are looking for a DataOps Engineer with experience in machine learning environments to join our team. You will drive the development of AI products for external clients and internal efforts, and participate in the development of top-notch AI systems.

At Factored we are building a company that we all hold as our own, every single one of us. We need your skills to help take this rocketship to new heights and help create new opportunities for us.  In return, you will be rewarded with an amazing team that supports you, rich culture, shared success, and the flexibility to work– from the comfort of your home. #LI-Remote

What you will be doing:

    • Design, build, and maintain data pipelines that collect, transform, and store data from various sources. This may include batch processing, real-time streaming, and ETL (Extract, Transform, Load) tasks.
    •  Integrate data from different systems and sources, ensuring seamless data access and compatibility between various data stores and formats.
    • Utilize your development expertise in data processing frameworks such as Apache Spark, Apache Flink, or similar technologies to build robust data pipelines.
    • Set up and manage data storage systems, databases, data warehouses, and cloud infrastructure to support data processing and analytics.
    • Monitor data pipelines and infrastructure for performance issues, bottlenecks, and errors. Optimize processes for improved efficiency and scalability.
    • Utilize Github for version control, code collaboration, and project management.
    • Lead or support the development of proofs of concept and prototypes to evaluate the feasibility and effectiveness of new data technologies.
    • Employ Apache Airflow or Dagster to efficiently create and manage data workflows and pipelines.
    • Work with both SQL and NoSQL databases, utilizing your expertise to optimize data storage and retrieval.
    • Utilize Kubernetes to manage and orchestrate containerized data applications with a development-oriented perspective.

What you must bring:

    • 4+ years of DataOps or Data Infrastructure experience
    • 3+ years of experience working with data operations projects or related fields.
    • Demonstrated experience with tools such as Python and Databricks.
    • Experience with cloud platforms (e.g., AWS, Azure, GCP) and containerization (e.g., Docker, Kubernetes).
    • Experience working with both SQL and NoSQL databases.
    • Strong understanding of data modeling, data warehousing, and data governance principles.
    • Previous involvement in designing and executing proofs of concept or similar projects.
    • Comfortable using Github for code management and collaboration.
    • Strong written and oral communication in English.
At Factored, we believe that passionate, smart people expect honesty and transparency, as well as the freedom to do the best work of their lives while learning and growing as much as possible. Great people enjoy working with other passionate, smart people, so we believe in hiring right, and are very selective about who joins our team. Once we hire you, we will invest in you and support your career and professional growth in many meaningful ways. We hire people who are supremely intelligent and talented, but we recognize that intelligence is not enough. Perhaps more importantly, we look for those who are also passionate about our mission and are honest, diligent, collaborative, kind to others, and fun to be around. Life is too short to work with people who don’t inspire you.  

We are a transparent workplace, where EVERYBODY has a voice in building OUR company, and where learning and growth is available to everyone based on their merits, not just on stamps on their resume. As impressive as some of the stamps on our resumes are, we recognize that human talent and passion exist everywhere, and come from many backgrounds, so stamps matter much less than results. All of us are dedicated doers and are highly energetic, focusing vehemently on execution because we know that the best learning happens by doing. We recognize that we are creating OUR COMPANY TOGETHER, which is not only a high-performing fast-growing business but is changing the way the world perceives the quality of technical talent in Latin America. We are fueled by the great positive impact we are making in the places where we do business, and are committed to accelerating careers and investing in hundreds (and hopefully thousands) of highly talented data science engineers and data analysts. 

In short, our business is about people, so we hire the best people and invest as much as possible in making them fall in love with their work, their learning, and their mission.  When not nerding out on data science, we love to make music together, play sports, play games, dance salsa, cook delicious food, brew the best coffee, throw the best parties, and generally have a great time with each other.