This job has expired, please see additional jobs below
Monitoring Specialist
Entertainment & Media Industry Company
Montreal, , Canada
Job Details - this job has expired, please see similar jobs below
Summary
The Monitoring Specialist is responsible for supporting IT operations teams with effective monitoring and event management solutions. Processes and solutions must meet the demanding nature of online gaming and distributed applications in a complex, cloud-based and virtualized environment. The monitoring specialist will be part of a transversal team charged with driving improvements to infrastructure, application and service monitoring within IT.
Mission
The main responsibilities and routine tasks of the Monitoring Specialist are to:
• Implement efficient event management processes and automation;
• Configure and maintain central monitoring platforms;
• Perform daily health checks and break/fix support on monitoring platforms;
• Manage new technology integrations into monitoring systems;
• Participate in the governance of application and infrastructure monitoring design, implementation, customization and support;
• Design console solutions to consolidate views of service events for support staff;
• Provide event logging and historical repositories to aid in the investigation and prevention of incidents, problems and service quality issues;
• Provide filtering and event correlation mechanisms to reduce event noise;
• Grow the technical skillset of everyone, including his/her peers though peer mentoring, coaching, training, etc.
• Recommend the establishment or modification of current policies and standards where applicable;
• Keep on top of current and emerging technologies;
• Carry out all other related tasks.
Qualifications
Training
Bachelor’s degree or equivalent experience in Computer Information Systems, Computer Science, Mathematics or a related field.
Relevant experience
• 5+ years of experience in IT with at least 3 years in a large scale enterprise environment
• 3+ years of experience in IT operations, NOC experience a plus
• 3+ years of experience managing monitoring solutions
• Application and end-user experience monitoring preferred
Skills
• Oral and written comprehension of English
• “Can-do” attitude and a high degree of motivation
• Ability to quickly understand new tools
• Ability to make complex information accessible
• Ability to influence others in a matrixed environment
• Ability to drive timely completion of tasks
• Easily adaptable to changes in a fast-paced environment
• Must be a self-starter that requires only limited supervision/guidance
• Be a team player
• Good interpersonal communication skills
Knowledge
• Comfortable with scripting or programming languages
• Good knowledge on infrastructure protocols to gather element-level event data
• Good knowledge of open source monitoring technologies like time-series DBs, metrics dashboards, real-time graphing, graph editors, ELK stack and Vector framework
• Proficient with data lifecycles and aggregation, reporting and web dashboards
• Good understanding of network performance monitoring, application performance management, high-resolution systems monitoring and IT operations analytics
• Proficient in ITIL event management and good basis in ITIL foundational concepts
• Good understanding of event correlation and analysis techniques and solutions
• Exposure to several of the following products: Zabbix, Solarwinds, SCOM, Sumo Logic, Splunk, AppDynamics, New Relic, DataDog, Sensu, and Consul
• Broad understanding of cloud environments, distributed application architectures, and web-scale technologies