Sr. DevOps Engineer

Santa Monica, CA
Hallmark Labs
POSITION: Sr. DevOps Engineer
LOCATION: Santa Monica, CA
CONTACT: Kristen Starick, Recruiting Associate, e., m. 323.521.4942
Hallmark Labs (the digital subsidiary of Hallmark), is the parent company of two digital subscription services Hallmark Movies Now and as well as ongoing initiatives in personalized, print-on-demand greeting cards.
We’re pushing Hallmark’s century-old brand to the digital forefront with cutting-edge technology and innovative products. In true Hallmark fashion, Hallmark Labs creates a more emotionally connected world by making a genuine difference in every life, every day.
·       Enjoy the autonomy, innovation, and benefits of a startup culture, with the deep-pocket funding and established customer base of a 109-year-old company
·       Work on Hallmark’s most innovative, digital product lines
·       Become part of a growing, dedicated engineering team that deeply values internal growth and professional development
This position spans systems management and IaaS automation in building, monitoring, maintaining, and alerting Linux systems in AWS, to working with QA to ensure their test automation is running correctly on every commit to GitHub. This job is all about automation, IaaS, and uptime. Responsibilities include change management, access control, addressing any issues that aren't already automated, and working with engineering teams to ensure the solutions they're deploying are supportive and scalable for our growing customer base.We love innovation, and support efforts that provide automated systems for the purpose of 99.99% uptime.


    • Work closely with infrastructure, engineering, and customer service teams to ensure services are available 24/7
    • Drive technical innovation and efficiency in infrastructure operations via automation
    • Design systems management solutions using automation and self-repair rather than relying on alarming and human intervention
    • Ensure all systems have required security compliance for patch management, anti-virus, and other threat protection 
    • Create processes that enhance operational workflow and provide positive customer impact
    • Dive deep to resolve problems at their root, looking for failure patterns amenable to long-term solutions via simplification and automation
    • Avoid re-inventing the wheel and prefer appropriately simple, repeatable solutions over more complex and failure-prone ones
    • Act as a technical point of escalation 
    • Develop appropriate metrics to demonstrate performance at improving operational efficiency
    • Recognize and adopt best practices in documentation, testing, security, operational support at scale, and efficient use of resources
    • Support off-hours on-call
    • Problem solve and troubleshoot, including performing root-cause analysis for preventative analysis
    • Work on small, cross-functional, fast-paced teams
    • Utilize organizational skills and the ability to manage a diversified workload
    • Communicate effectively with all levels of staff, including senior management
    • Work under minimal supervision on complex issues to deliver results on schedule


    • EDUCATION: BA/BS in Computer Science or related field
    • Substantial enterprise infrastructure experience
    • AWS experience is a must
    • IaaS design and micro-service systems architecture experience, or related experience
    • Experience with capacity planning, utilization review, and monitoring of availability and performance
    • Held a prior role with responsibility for High Scalability/Availability Systems Architecture, Security, and/or Systems Support
    • Experience with configuration and management of multiple server platforms


    • Strong with automation languages such as Ruby, Python, or Go
    • Experience with configuration management tools such as Ansible, Puppet, or Chef
    • Implemented continuous integration tools such as Jenkins, Rundeck, Ant, or Maven
    • Used ELK, Grafana, Zabbix, Cloudwatch, Cloudformation, or other open source/cloud ready tools
    • Implemented, managed, and refined disaster recovery solutions
    • Proficient in TCP/IP networking, architecture, and other core network technologies (DNS, HTTP, Routing, Firewalls, Load Balancers, etc.)
    • Familiarity with both SQL and NoSQL technologies such as MySQL, MongoDB, Redis, etc.
    • Knowledge of Agile processes and DevOps manifesto


    • Diverse team of collaborative, inspired professionals
    • Highly competitive salary
    • Excellent medical benefits
    • 401k with 5% match
    • Life insurance at no cost
    • Flexible PTO
    • Flexible work hours
    • Generous maternity/paternity leave
    • Employee assistance programs
    • Cell phone and home internet reimbursement
    • Fitness reimbursement
    • On-site free parking; carpool and parking pass cash-out program
    • Passionate about equal opportunities in the workplace
    • We have the normal perks for a company in Santa Monica: Fully stocked kitchen, catered lunches, writable walls, collaboration spaces, casual dress, bicycles, company events, professional development, etc.