This job has expired, please see additional jobs below

Site Reliability Engineer

Entertainment & Media Industry Company

San Francisco, CA, United States

Job Details - this job has expired, please see similar jobs below

Summary Our Site Reliability Engineers are the primary interface between our developers and our production operations. No matter how many times we get searched, scraped, scanned, spammed, pinged, paged or queried, keep calm - and keep the site running smoothly. We work in both the dev and systems worlds, instrumenting key parts of core architecture while supporting developers as they try to do the same. We're looking for a renaissance coder - someone who lives, breathes and dreams automation and troubleshooting. Of course, you'll work with a variety of programming languages, but get ready to dive into distributed systems, strapped with tools such as strace and tcpdump, to go solve problems! You'll implement monitoring and alerting systems to support site stability and performance. You'll proactively scale our infrastructure to meet ever-increasing demand. You'll make sure that when something goes bump in the night, someone hears it. And you'll play a key role in keeping Company fast, stable and growing. What You'll Do: • Work closely with developers in supporting new features and services. • Monitor site stability and performance. • Scale infrastructure to meet demand. • Troubleshoot site issues. • Develop custom tools as necessary. • Document system design and procedures. • Participate in light on-call rotation. We are Looking For: • Mastery of Linux or Unix. • Command of your favorite modern programming language: Python, Ruby, Java, C++, etc. • Proficiency with configuration management tools like puppet, chef, ansible, etc. • Solid understanding of fundamental networking technologies. • Knowledge of best practices related to security, performance, and disaster recovery. • Experience with web server configuration, monitoring, trending, network design, high availability. • Excellent communication skills. • A sense of humor! Pluses: • Past experience with MySQL, PostgreSQL, or replicated other databases (high availability, scale-out replication). • Advanced knowledge of network design, management of Juniper network equipment, or BGP. • Experience at a large-scale consumer internet site. • Ubuntu distribution familiarity. • Deep understanding of the Python runtime and ecosystem. Interested in applying? Sweet! Share with us why you want to work at Company, and don't forget to mention any side projects, open source contributions, and personal websites/blogs. Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

This job has expired, please see additional jobs below

Site Reliability Engineer

Job Details - this job has expired, please see similar jobs below

SIGN UP