This job has expired, please see additional jobs below
Director of Engineering - Operational Telemetry
Walmart
Sunnyvale, CA, United States
Job Details - this job has expired, please see similar jobs below
Position Description
Walmart's System Telemetry team needs a Director to scale up our eCommerce-focused, multitenant system telemetry services to the diverse needs of the whole company, including applications for associates, suppliers, and in-store customers. System telemetry technologies of all sorts are evolving rapidly, as are the systems they support; the challenge for this person is to make these bleeding-edge capabilities available to the Walmart enterprise, effortlessly and reliably, enabling extreme agility in combination with operational excellence. Equally important, this person will develop the capabilities of the team and its products, continuously improving ease-of-use, data quality, data accessibility, and overall transparency.
#LI-WW1
Minimum Qualifications
5+ years of experience building AND operating large scale, high availability software applications
Direct, extensive experience instrumenting software, monitoring and managing alerts, diagnosing problems by analyzing logs, and profiling to improve performance
Firm understandings of operating system, network, and infrastructure technologies; including essential metrics, failure modes, and potential resource constraints
Facility analyzing huge datasets using both structured (data warehouse) and unstructured (big data) tools and techniques
Ability to engage and keep up with a high-performing team in a fast-paced environment comprised of competing critical objectives
Excellent communication and collaboration skills
Additional Preferred Qualifications
Experience governing dynamic datasets among large sets of both producers and consumers
Experience building and monitoring cloud- and container-based applications
Experience with the CNCF ecosystem of technologies
Experience supporting monitoring tools and systems (Prometheus, InfluxDB, Sensu, ...)
Experience using or supporting Application Performance Management tools (New Relic, App Dynamics, DataDog, DynaTrace, ...)
Experience using or supporting log analytics tools (Splunk, ...)
Experience using or supporting application tracing technology (Jaeger, ...)
Experience building stream processing applications (Storm, Spark, Flink, ...)
Company Summary
Walmart's Global Cloud Platform team helps thousands of developers save time and code better, so that millions of Walmart associates can help hundreds of millions of customers save money and live better. We sustain millions of transactions per second, process petabytes of data, and enable tens of thousands of production deployments per day. We simplify the complexities of scale and unify software development for all aspects of the business, digital and physical.
At Walmart, we understand that it's our people who make the difference. The talent and dedication of our teammates -- as they share an innovative idea, forge a well-crafted piece of code, or take the time to listen and make sure we're tackling the right problem -- accelerates the business and inspires us to do our own best work. With 69% GMV year-over-year eCommerce growth, websites in 11 countries, plus 11,000+ stores worldwide, Walmart generates $500 billion in annual revenue, and continues to revolutionize retail at a scale no one else can match.