Lead Data Engineer Resume Samples

4.7 (89 votes) for Lead Data Engineer Resume Samples

The Guide To Resume Tailoring

Guide the recruiter to the conclusion that you are the best candidate for the lead data engineer job. It’s actually very simple. Tailor your resume by picking relevant responsibilities from the examples below and then add your accomplishments. This way, you can position yourself in the best way to get hired.

Craft your perfect resume by picking job responsibilities written by professional recruiters

Pick from the thousands of curated job responsibilities used by the leading companies

Tailor your resume & cover letter with wording that best fits for each job you apply

Resume Builder

Create a Resume in Minutes with Professional Resume Templates

Resume Builder
CHOOSE THE BEST TEMPLATE - Choose from 15 Leading Templates. No need to think about design details.
USE PRE-WRITTEN BULLET POINTS - Select from thousands of pre-written bullet points.
SAVE YOUR DOCUMENTS IN PDF FILES - Instantly download in PDF format or share a custom link.

Resume Builder

Create a Resume in Minutes with Professional Resume Templates

Create a Resume in Minutes
SH
S Homenick
Solon
Homenick
941 Deontae Skyway
Philadelphia
PA
+1 (555) 166 6011
941 Deontae Skyway
Philadelphia
PA
Phone
p +1 (555) 166 6011
Experience Experience
Houston, TX
Lead Data Engineer
Houston, TX
Dietrich, Pfeffer and Nikolaus
Houston, TX
Lead Data Engineer
  • Architect, Design, Develop, and Improve databases and ETL processes in scope of Application development
  • Provide direction and guidance to other Big Data developers to solve problems, improve efficiency and process, and employ new technology
  • Manager will be expected to perform hands on coding, performance optimization
  • Provides scoping, estimating, planning, design, development, and support services to a project
  • Assist in the definition of software architecture to ensure that the online organization’s software solutions are built within a consistent framework
  • Work with business and technology partners to provide reporting capabilities for all our internal customers
  • Provide subject matter expertise in the analysis, preparation of specifications and plans for the development of data processes
San Francisco, CA
Shopstyle Lead Data Engineer
San Francisco, CA
Gleichner, Hayes and Stroman
San Francisco, CA
Shopstyle Lead Data Engineer
  • Manage more junior members of the development team
  • Lead the software development lifecycle of new processing jobs and data pipelines
  • Working knowledge of both relational and document-oriented database systems
  • Architect improvements to ShopStyle’s data processing platform
  • BS in Computer Science or demonstrated sustained success in the software engineering field
  • Competent Scala programmer
  • Basic knowledge of Machine Learning techniques and algorithms
present
Detroit, MI
Lead Data Engineer Data & Analytics
Detroit, MI
Bartoletti, Cruickshank and Hermann
present
Detroit, MI
Lead Data Engineer Data & Analytics
present
  • Work across multiple cross-functional teams in high visibility roles and own the data solution end-to-end
  • Adopt a FastWorks mindset to support Business data and analytics projects
  • Apply your expertise in quantitative business analysis, data mining, and the presentation of data to see beyond the numbers and drive enterprise level outcomes
  • Help drive a best-in-class data engineering practice that will be leveraged across GE
  • Participates in setting strategy and standards through data architecture and implementation leveraging big data and analytics tools and technologies
  • Develop high performance, distributed computing algorithms using Big Data technologies such as Hadoop, text mining, and other distributed environment technologies
  • Assist in creating documents that ensure consistency in development across the online organization. Implements and improves core software infrastructure
Education Education
Bachelor’s Degree in Computer Science
Bachelor’s Degree in Computer Science
Belmont University
Bachelor’s Degree in Computer Science
Skills Skills
  • The ability to develop reliable, maintainable, efficient code in most of SQL, Linux shell, Java and Python
  • The ability to prioritize effectively in order to be productive in a highly dynamic environment
  • Work across IT teams to ensure code quality, performance, and scalability of deployed data products
  • Very good knowledge of standard SQL (SQL-92) gained using large scale database systems such as Oracle, Netezza, DB2, Sybase
  • Ability to communicate to a wide audience of professionals internally and externally
  • Excellent knowledge of data structures, algorithms and design patterns
  • Knowledge of the global payments industry or a good understanding of payments transaction life cycle an advantage
  • A good understanding of UNIX systems, especially Linux
  • Excellent organizational, communication and time management skills
  • Strong ETL / Data Warehouse skills
Create a Resume in Minutes

15 Lead Data Engineer resume templates

1

Lead Data Engineer Resume Examples & Samples

  • 10+ years experience developing data systems or similar complexity systems
  • 2+ years with either Hadoop/Cloudera ecosystem (HDFS, Impala, Hive) OR Cassandra
  • 2+ years experience with Java OR Scala
  • 2+ years with Kafka OR Flume and Storm
  • 2+ years experience with standard developer tooling (Puppet, Jenkins, Ant, etc.) on Linux Environments
  • 2+ years experience with the following data technologies: Splunk, MySQL (master/slave replication), C/C++, or SQL
2

Lead Data Engineer Resume Examples & Samples

  • Experience working with MVC based front-end libraries such as Angular JS
  • Experience in building web applications in a Linux environment and handling or analyzing large volumes of data
  • Experience working with tag management systems
  • Experience working with cloud based technologies like AWS
  • Experience in product optimization or A/B testing is a plus
  • B.S or M.S in Computer Science or related fields preferred
  • Lead a team of engineers tasked with building the next generation internal dashboard tool
  • Extend our real time data streaming capabilities for timely insights
  • Extend analytics via automation and tooling to the entire digital organization to enable cohesive data driven decisions
  • Design and develop event-tracking mechanism based on web/app analytics projects
3

Lead Data Engineer Resume Examples & Samples

  • Microstrategy Development including creation of metrics, attributes, schema object, reports and dashboards
  • Analyzing, designing, coding, testing, installing and maintaining complex applications
  • MicroStrategy Intelligent cubes, multi source, dashboards dynamic sourcing, incremental refreshing and mobile
  • MicroStrategy Narrowcast and report services
  • Developing schema and base objects reporting objects in a power user deployment model
4

Lead Data Engineer Resume Examples & Samples

  • 5+ years in data warehousing, schema design and data modeling
  • 2+ years experience building production data pipelines (using Hadoop, Hive, Pig, Spark, etc.) on web-scale datasets
  • 2+ years experience in custom or structured ETL design, implementation and maintenance
  • 1+ years experience with AWS
  • Programming proficiency in at least one major language
  • General knowledge of predictive modeling and algorithms
5

Lead Data Engineer Resume Examples & Samples

  • Lead the administration of on premise and cloud based Hadoop clusters
  • Make information available to large scale, next generation, predictive analytics applications
  • Lead the effort to build, implement and support the data infrastructure; ingest and transform data (ETL/ELT process)
  • Bachelor’s degree in computer science, software engineering, or related field
  • Three years of experience architecting, building and administering large-scale distributed applications using Hadoop and/or Linux systems
6

Lead Data Engineer Data & Analytics Resume Examples & Samples

  • Help drive a best-in-class data engineering practice that will be leveraged across GE
  • Participates in setting strategy and standards through data architecture and implementation leveraging big data and analytics tools and technologies
  • Bachelor's Degree in Computer Science, Information Technology or equivalent(STEM) with minimum 4 years' experience as data engineer or data architect, Advanced degrees preferred
  • A minimum of 3 years of efficient SQL (Oracle, Vertica, Hive, etc) experience is required
  • A minimum of 1 years of efficient Scripting (Pig, Python, Perl, etc) experience is required
  • A minimum of 1 years’ experience using Hadoop ecosystem
7

Lead Data Engineer Resume Examples & Samples

  • Create state-of-the-art data and analytics driven solutions, developing and deploying cutting edge scalable algorithms, working across GE to drive business analytics to a new level of predictive analytics while leveraging big data tools and technologies
  • Architect, design and develop new systems and tools to enable users to consume and understand data faster
  • Adopt a FastWorks mindset to support Business data and analytics projects
  • Engineer structured and unstructured data from source systems to fit business needs
  • Bachelor's degree in Computer Science, Science, Information Technology, Engineering (any) or equivalent
  • The position requires experience using the Hadoop ecosystem, MapReduce/Spark, or experience with a NoSQL technology (HBase or MongoDB or Cassandra)
  • Experience using Scripting (Python/Ruby/Perl)
  • Experience working on SQL databases
8

Lead Data Engineer Resume Examples & Samples

  • Architect, Design and develop new systems and tools to enable users to consume and understand data faster
  • Adopt a Fast Works mindset to support Business data and analytics projects
  • Participates in setting strategy and standards through data architecture and implementation
  • Engineers structured and unstructured data from source systems to fit business needs
  • Leads analysis, model and design the application data structure, storage, integration, deployment and support
  • Fluent in both normalized and dimensional model disciplines and techniques and in creating data logical and physical data models
  • Proficient in using data modeling tools, such as Power Designer or Erwin
  • A minimum of 5 years of professional experience in software development or as a Data engineer OR Master’s degree with 3 years of experience in software development or Data engineer
  • A minimum of 1 years of experience using Scripting (Python, Perl, Shell, etc.) is required
  • A minimum of 1 years of experience working on Database(s) like PostgreSQL, Oracle, SQL Server. SQL is required
  • Advanced degrees preferred
  • Ability to leverage data assets to respond to complex questions that require timely answers
  • Skilled in breaking down problems, documenting problem statements and estimating efforts
  • Ability to takes ownership of small and medium sized tasks and deliver while mentoring and helping team members
  • Continuously measures deliverables of self and team against scheduled commitments. Effectively balances different, competing objectives Strong interpersonal skills
  • Effective team building and problem solving abilities
9

Lead Data Engineer Data & Analytics Resume Examples & Samples

  • Apply your expertise in quantitative business analysis, data mining, and the presentation of data to see beyond the numbers and drive enterprise level outcomes
  • Develop high performance, distributed computing algorithms using Big Data technologies such as Hadoop, text mining, and other distributed environment technologies
  • Execute and evaluate appropriate analyses (cluster analysis, logistic/linear regression, collaborative filtering, etc.) given an array of tactical and strategic objectives
  • Bachelors in Computer Science, Math, Physics, Applied Economics, Statistics or other technical field. Advanced degrees preferred
  • 2+ years’ experience doing quantitative analysis
10

Lead Data Engineer Resume Examples & Samples

  • Work across multiple cross-functional teams and own the data solution end-to-end
  • Architect, Design, Develop, and Improve databases and ETL processes in scope of Application development
  • Bachelor's Degree in Computer Science, Information Technology or equivalent (STEM) with minimum 5 years of experience as data engineer
  • Minimum 3 years of experience in designing and developing database solutions
  • Minimum 3 years of experience in designing and developing ETL solutions
  • Experience working with AWS DB and ETL tech stack
  • Experience working with PostgreSQL DB and Oracle DB
11

Lead Data Engineer Resume Examples & Samples

  • The Big Data Engineer will provide technological expertise in the implementation, production and maintenance of our data science initiatives e.g. large-scale recommender system etc. The role will be responsible for building data pipeline, implement the production-ready data-driven system/solution as well as supporting the deployment/usage of these solutions
  • Implementation – build the data pipeline and implement the data science prototype into production-ready solution
  • Deployment – manage the source code of solution and deploy the solution from DEV environment to UAT environment and then the production environment with proper testing plan
  • Maintenance – implement automation solutions to monitor the performance of predictive models and refresh model when needed
  • Administration – manage the Hadoop/spark cluster to prioritize and manage multiple distributed computing jobs to generate in-time insights for drive business impact
12

Lead Data Engineer Resume Examples & Samples

  • 3) Support, maintain, and improve existing logic that moves data into and thru the Hadoop cluster (Nifi, python scripts, custom java parsers, Hadoop map reduce jobs Pig scripts, Spark scripts, OOZIE workflow, etc…)
  • 4) Improve and enhance the value provided by Gogo’s Hadoop cluster, in conjunction with the Hadoop Architect, through the application of Apache Spark and other emerging technologies
  • 5) Operate as the senior-most developer performing development work themselves as the need arises
  • 6) Participate heavily in efforts to migrate Gogo’s Big Data cluster from a self-hosted environment to AWS
  • Function as the Lead Hadoop Developer on a team of 4-8 individuals: Driving work through the team, reviewing work, coding as required, and ensuring the overall success of the team
  • Own the majority of deliverables for the Big Data team from a delivery perspective. Receive and adhere to project delivery deadlines
  • Provide direction and guidance to other Big Data developers to solve problems, improve efficiency and process, and employ new technology
  • Maintain, support, and enhance all elements of Gogo’s Hadoop Cluster and Big Data environment
  • Regularly interact with other development teams to understand the general technologies that they leverage for source code management, code development and deployment processes/ methodologies to ensure that the Redwood team maintains, or applies, new technology
  • Create jobs to perform auditing and error handling within the Hadoop cluster
  • Create automated jobs to push data from the Hadoop cluster to an Enterprise Data Warehouse
  • Troubleshoot and resolve issues that arise in the Hadoop cluster
  • Participate in an on call rotation to support the Hadoop cluster
  • Interface with various user groups to assess requirements and design robust solutions to meet them
  • Explore new technological solutions, learn and trial them, and deploy them in the Hadoop environment to solve business and operational challenges
  • 2+ years of experience operating as a lead developer and/or hands-on manager
  • 2+ years of strong development experience on a Hadoop stack
  • 10+ years of experience in the data/database space
  • 5+ years of data integration experience
  • Very proficient in writing Apache Pig and Apache Spark including an understanding of optimization techniques
  • Experience with Apache Nifi a plus
  • Very proficient in Python, Scala, or Java
  • Strong experience processing/parsing files using a scripting language
  • Experience Importing/exporting data using SQOOP or FLUME
  • Experience creating tables/views in Apache Hive
  • Experience authoring queries against a Hadoop file system
  • Direct, hands-on experience writing Hadoop map reduce jobs and developing an OOZI workflow
  • Experience writing complex SQL queries
  • Expert level knowledge of software development process and practices
  • Experience performing data transformations via scripting, stored procedures, or an ETL tool a plus
  • Technical education is a plus as is advanced degree in CS, EE, MIS or CIS
  • Experience working with cellular network data or airline data a plus
13

Lead Data Engineer Resume Examples & Samples

  • Manages and leads the production support of multiple data platforms
  • Work closely with application development managers to ensure the application requirements are satisfied by the Data Platform
  • Manager will be expected to perform hands on coding, performance optimization
  • Partners with Business Teams to assess how technologies can best streamline processes and/or add business value
  • Minimum of a B.S. in Computer Science, MIS or related degree and seven (7) years of related experience including management or leadership experience or combination of education, training and experience
  • Expert level experience with ODI is required; experience with Informatica is a plus
  • Financial services industry experience is a plus
  • Interacts with others in a way that promotes openness & trust and gives confidence in one’s intentions
14

Lead Data Engineer Resume Examples & Samples

  • Design data integration solutions that deliver business value in line with the company's objectives
  • Provide the thought leadership within IT to advise the business, mentor staff, and leverage industry best practices in data integration
  • Provides scoping, estimating, planning, design, development, and support services to a project
  • Identify and develop the Technical detail design document
  • Document and present solution alternatives to clients, which support business processes and business objectives
  • Collaborate with the IT product management team to influence product direction
  • Track progress and intervene as needed to eliminate barriers and ensure delivery
  • Resolve or escalate problems, and manage risk for both development and production support
  • Coordinate vendors and contractors for specific projects or systems
  • Review the technical work of other team members
  • Ensure adherence to IT standards and methods
  • Maintain deep knowledge and awareness of technical & industry best practices and trends, especially in technology & methodologies
  • Ensure that good business relationships are maintained with multiple clients and other IT departments to ensure successful implementation and support of project efforts
  • Responsible for communicating status, problems, issues and underlying process problems to technical and non-technical audiences
  • Participate in establishing IT standards and processes
  • Participate in recruiting employees and contractors
  • Participate in the evaluation of new software and hardware
  • Provide input to manager with respect to training needs of team members
  • Provide training, guidance, assistance, and knowledge transfer among team members
15

Lead Data Engineer Resume Examples & Samples

  • Experience with Big-Data Cluster-Computation: Hadoop/Hive/Pig and related technologies
  • Experience with SQL and noSQL based technologies
  • Strong engineering background, ideally experienced with large scale data/distributed systems
  • Experience building and consuming REST APIs
  • Experience building modern web applications using one or more of the following (Ruby, Python, Java, Node.js)
  • Familiarity with Statistical analysis and modeling
  • Strong algorithmic and analytical skills
  • Minimum 3 years’ experience with Big data technologies
  • Education – Bachelor's in Computer Science
  • Experience dealing with large amounts of data for internet scale production applications
  • Experience with Shell scripting
  • Experience in creating pivot-tables, cubes (likes of SSAS), reports (likes of SSRS) and dashboards
  • Experience with payments industry
16

Lead Data Engineer Resume Examples & Samples

  • Provide consultation of data and application integration technologies
  • Define integration strategies
  • Design, develop, configure, test and deploy data integration across the analytic solution
  • Ensure data loads from various source systems and files
  • No direct reports, will have to work with multiple suppliers
  • Experience in at least 3-5 full lifecycle IT delivery projects
  • Practical experience with SAP Integration (ALE, RFC, IDoc), ideally with Oracle as well
  • Knowledge of Data Warehouse and BI solutions, Tableau is a plus
  • Ideally familiar with AWS and JavaScript
  • Preferably having a good understanding of Supply Chain and Finance areas
  • Ideally comes with finished studies in IT
  • Professional, can-do and will-do attitude is required
  • Experience and exposure to highly international work environment
  • Ability to work in a disciplined, focused and structured manner with good time-management skills
  • Passion for quality work
17

Shopstyle Lead Data Engineer Resume Examples & Samples

  • Architect improvements to ShopStyle’s data processing platform
  • Lead the software development lifecycle of new processing jobs and data pipelines
  • Work with data scientists and other business members to develop requirements for data engineering projects
  • Manage more junior members of the development team
18

Lead Data Engineer Resume Examples & Samples

  • 7+ years of development experience with Python
  • Strong Unix/Linux skills
  • Experience in petabyte scale data environments
  • Django experience a plus
  • Hadoop experience a plus
  • SQL on Hadoop (Hawq/Impala/Presto) experience a plus
19

Lead Data Engineer Resume Examples & Samples

  • Implementation and testing of statistical models in Python and Java
  • Researching and evaluating applications of mathematics and machine learning to our products
  • Performing data analysis on large data sets
  • Communicating results of work with teammates and company stakeholders
  • 4+ years of relevant experience on a team blending data science and software development
20

Lead Data Engineer Resume Examples & Samples

  • Ability to quickly identify an opportunity and recommend possible technical solutions
  • Utilize multiple development languages/tools such as Python, SPARK, HBase, Hive, Microsoft R, Java to build prototypes and evaluate results for effectiveness and feasibility
  • Operationalize open source data-analytic tools for enterprise useDevelop real-time data ingestion and stream-analytic solutions leveraging technologies such as Kafka, Apache Spark, NIFI, Python, HBase and Hadoop
  • Custom Data pipeline development (Cloud and locally hosted) Work heavily within the Hadoop ecosystem and migrate data from Teradata to Hadoop
  • 3+ years’ experience working with Hadoop
21

Lead Data Engineer Resume Examples & Samples

  • 7-10 or more years of progressively complex related experience
  • Has in-depth knowledge of large scale search applications and building high volume data pipelines
  • Advanced knowledge in Hadoop architecture
  • Experience with Big Data Movement ( Ingest/ Outbound) in a Cloud environment (AWS)
  • Health care data experience is preferred
22

Lead Data Engineer Resume Examples & Samples

  • Work with site, Aerospace resources to develop cross—platform ETL to support advanced analytics for ISC
  • Mine systems data for insights to improve COPQ
  • Design, develop, and maintain cross-platform ETL processes and maintain dimensions and reference lookup dictionaries
  • Develop guidelines, standards, and processes to ensure the highest data quality and integrity in the data stores residing on the Big Data platform
  • Work closely with data scientists and data owners in the supply chain to understand their data requirements for existing and future projects on data analytics applications
  • Work with IT and data owners to understand the types of data collected in various databases and data warehouses and define the migration strategy to move existing data into the Big Data platform
  • Additional Attributes
  • Understanding of supply chain and operations processes including how analytics can improve COPQ
  • Ability to execute projects using an agile approach in a multi-disciplinary, matrixed environment
  • Comfortable working in a dynamic, research and development environment with several ongoing concurrent projects
  • Enjoys exploring and learning new technologies
  • Bachelor degree in computer science, IT, engineering, or other relevant field with a minimum of 5 years of data management experience
  • Ability to identify data sources that will drive improvements in COPQ
  • 6+ years of experience in designing, deploying, and supporting data systems and solutions
  • 4+ years of experience in migrating data from data sources (MS SQL, Oracle, MySQL etc.)
  • 5+ years of experience in working with various database technologies and writing complex queries
  • 3 year experience with Servigistics, Rapid Reponse, SAP (planning/procurement modules), Business Objects, Quality systems
  • Masters or PhD degrees in computer science, IT, engineering or relevant fields
  • Certification in Hadoop and other big data tools and technologies
  • Experience with open source data processing frameworks
  • Experience with supply chain
  • Experience with data management on public cloud hosting services
  • Experience with predictive analytics
  • Experience with Agile software development methodology
  • Ability to work in a fast-paced and ambiguous environment
  • 4+ years of experience in scripting languages (Perl, Python, Java etc)
23

Lead Data Engineer Resume Examples & Samples

  • Ability to translate user requirements and architecture patters into defined BI deliverables
  • Drive creation of state of the art BI and analytics solutions to support GE Healthcare business
  • Own results of work performed together with your colleagues
  • Responsible for multiple programming languages, basic systems analysis techniques, testing, debugging, documentation standards, file design, storage, and interfacing
  • Maintains peer relationships across IT areas (infrastructure, operations, COEs, etc) to support effective implementations
  • Ability to execute multiple projects simultaneously
  • Ability to explain issues and resolutions to technical and non-technical staff
  • Previous IT programs development or design experience
  • PMP / ITIL certified
  • Experience with Teradata, ETL, ELT, Qlik, OBIEE, Spotfire , Tableau, Informatica, Hadoop
24

Lead Data Engineer Resume Examples & Samples

  • Implementation of test strategy for application under test such as Data Warehouse or Business Objectives
  • Data integration tools: Ab-Initio, Informatica or Data Stage
  • Commercial reporting tools including Business Objects or Microstrategy
  • Software testing design and execution utilizing tracking software: JIRA and HP Quality Center
  • Hadoop PlatformPlease mail resume to: A. Nelson, 151 Union St, San Francisco, CA 94111 and reference Req. # SS-10659
25

Lead Data Engineer Resume Examples & Samples

  • Understand how to build scalable, real time, streaming based, Big Data systems
  • Have developed and been a key influential member in a fully delivered data product
  • Lead the architecture and design of several modules related to the backend of a search system, a real time relevance engine, a system that computes several complex functions on the data on the fly, etc
  • Be a hands on developer and lead by example as a programmer
  • Provide guidance and contribute to coding standards
  • Be a leader who adopts a “performance and scale” czar role as required while understanding how to tradeoff performance and quality of output when required
  • Provide leadership in sprints, CI/CD and the DevOps efforts
26

Lead Data Engineer, Payformance Solutions Resume Examples & Samples

  • Excellent presentation and whiteboarding skills
  • BS (Masters or PhD preferred) degree in relevant disciplines: Bioinformatics, Medical Informatics, Healthcare Administration, Statistics, Applied Mathematics, Operations Research/ Optimization, Computer Science, Computational/ Theoretical Physics, Data Science, or Electrical/ Computer Engineering
  • 10+ years of experience working on Java applications, strong knowledge of Object Orientated Programming, and experience with multiple databases. Candidates with less experience should be prepared to discuss projects that supplement experience
  • Self-motivated, enthusiastic, and a quick learner. You should have a broad base of software development experience, and be interested in continuing to grow technically via hands on experience and learning. Desire and ability to quickly learn new technologies on your own
  • Knowledge of, or willingness and aptitude to learn healthcare revenue cycle and claims data
  • Must have experience with
27

Lead Data Engineer Resume Examples & Samples

  • Accountable for all efforts in successfully delivering data engineering capability and delivers capabilities / services / solution based on stated data blueprint
  • Translates complex functional and technical requirements into detailed architecture, design, and high performing software
  • Leads data and batch/real-time analytical solutions leveraging transformational technologies
  • Works on multiple initiatives as a technical lead driving user story analysis and elaboration, design and development of software applications, testing, and builds automation tools
  • Selects data solution software and defines hardware requirements
  • Works with D&E team members to design and implement the data solutions in alignment with the initiative / product schedule
  • Creates strategies that use business analytics and data platforms
  • Builds and designs next-generation Data analytics framework developed on a group of core technologies
  • Implements security and recovery tools and techniques as required, works with developers to make sure that all data solutions are consistent
  • Creates data flow diagrams for all of business systems and builds automation tools
  • Ensures all automated processes preserve data by managing the alignment of data availability and integration processes
  • Creates and maintains data catalog, interprets data results to business stakeholders and develops standards and processes for integration initiatives
  • Leads the design of the logical model and implements the physical database to support business needs
  • Designs key and indexing schemes and designs partitioning, constructs and implements operational data stores and data marts
  • Ensures database changes are reviewed and approved according to database design standards and principles
  • Resolves conflicts between models, ensuring that data models are consistent with the ecosystem model (e.g., entity names, relationships and definitions) and conducts Level 2 and 3 support
  • Leads or participates in creating, refining, managing and enforcing data management policies, procedures, conventions and standards
  • Contributes to the establishment of business continuity & disaster recovery requirements, methods and procedures for data systems and databases
  • Performs technology and product research to identify opportunities that impact business strategy, business requirements and performance
  • Evaluates and provides feedback on future technologies and new releases/upgrades
  • Bachelor's in computer science, computer engineering, or equivalent work experience
  • Data engineering, data science, or software engineering experience
  • Demonstrated experience leading teams of engineers
  • Capability to architect highly scalable distributed systems, using different open source tools
  • Demonstrated experience with agile or other rapid application development methods
  • Demonstrated experience with object-oriented design, coding and testing patterns as well as experience in engineering (commercial or open source) software platforms and largescale data infrastructures
  • Understands how algorithms work and have experience building high-performance algorithms
  • Extensive knowledge in different programming or scripting languages
  • Expert knowledge of data modeling and understanding of different data structures and their benefits and limitations under particular use cases
  • Experience using Big Data batch and streaming tools
28

Lead Data Engineer Resume Examples & Samples

  • Transforms and aggregates large datasets using Spark and Scala languages
  • Engineers datasets for downstream modeling activities
  • Convert complex data into insights and action plans
  • Tests data for quality using several tools and languages
  • Uses SQL queries to transform data
  • Lead large scale projects that utilize online & offline data, structured & unstructured data, set top box data (media/behavioral/attitudinal) to build customer centric models and optimization tools
  • Dive into large, noisy, and complex real-world behavioral data to produce innovative analysis of historical patterns in customer behaviors and product performance
  • Mentor and train junior team members in advanced analytics techniques and new ideas
29

Lead Data Engineer Resume Examples & Samples

  • Lead designing architecture, engineering best practices to build multi-tenant Data infrastructure for DCPI
  • Develop engineering analytical solutions and program by following analytical models and solutions to help solve business problems
  • Provide engineering support for the usage and interpretation of data to various business partners
  • Perform engineering support for data-mining, deep learning and performance measurements on business-information systems utilizing analytics tools and methodologies
  • 5+ years solid engineering background with SQL, Linux script, Java or Python
  • 5+ years working experience working knowledge of Hadoop/Spark/Hive
  • 3+ years solid background with data mining and/or machine learning
  • Strong analytical aptitude required
  • Passionate about working with large volume of data
  • Knowledge of Hadoop, Hive, Spark and Pig in a Cloud environment preferred
30

Lead Data Engineer Resume Examples & Samples

  • Build data loading and transformation jobs using the Hortonworks toolset
  • Build data migration scripts to take all data stores in the cluster from one release to the next, without loss of data
  • The ability and willingness to create and maintain concise, accurate, readable, relevant documentation on our wiki (we use Confluence; knowledge of other source code control systems will be useful)
  • The discipline of working with a ticketing system
  • An understanding of file formats including csv, XML and JSON, as well as the related standards and technologies
  • Strong ETL / Data Warehouse skills
  • Experience with Jira and Confluence
  • Previous experience with Spark
  • Previous experience with Kafka and Attunity