This job has expired, please see additional jobs below
Content Extraction Specialist
Dow Jones
Princeton, NJ, United States
Job Details - this job has expired, please see similar jobs below
About Us
The Strategic Services team provides key strategic functions across Data Strategy, including Metadata Management, Business Management, Content Strategy, Quality and Solutions. The Strategic Services group houses all non-research capabilities within Data Strategy.
The Metadata Management team is responsible for the creation and development of Dow Jones Intelligent Identifiers -- our proprietary taxonomy of more than 3,000 industry, news subject and region tags -- and the application of DJID tags to articles in 28 languages to high quality standards of precision and recall. It also includes a small Extraction team involved in text mining.
Role
The Content Extraction Specialist reports to the Manager, Extraction, based in Princeton. The individual will be part of Strategic Services’ Metadata Management team and will also share responsibilities with the Autocoding Specialists in Metadata Management.
The Content Extraction Specialist works on various extraction projects using Text Analytics tools to extract "facts" or "events" from Factiva and Dow Jones content. As part of the Metadata Management team they will also master the Dow Jones Intelligent Identifiers and Factiva’s auto-categorization systems.
Responsibilities
• Research, produce and maintain expert rules for data extraction using third-party Text Analytics tools.
• Participate in code application to Factiva content based on DJ proprietary taxonomy.
• Develop and maintain document training sets for linguistics-based extraction and categorization tools.
• Test software releases as extraction and categorization tools are improved.
• Perform extraction and coding quality monitoring of publications, and refine extraction and categorization rules to ensure the highest levels of precision and recall.
• Undertake research and analysis to improve the overall enrichment of Factiva content.
• Troubleshoot extraction and coding problems.
• Answer quality issues questions escalated via internal quality monitoring workflows.
• Perform special ad-hoc projects as needed to support the needs of the business.
Skills & Experience
• Library science, linguistics, research or journalism background.
• Experience with taxonomies and classification systems a plus.
• Extremely detailed oriented
• Internet savvy.
• Excellent research and analytical skills.
• Understanding of Boolean search logic a plus.
• Strong organizational and time management skills.
• Advanced skills in logical reasoning.
• Proficiency in Microsoft Office applications.
• Understanding of Dow Jones Intelligent Identifiers
• Experience with Text Analytics products helpful.
• Native or Proficiency level in English language. West European language - a plus.
Dow Jones, Making Careers Newsworthy
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, protected veteran status, or disability status. EEO/AA/M/F/Disabled/Vets.
Dow Jones is committed to providing reasonable accommodation for qualified individuals with disabilities, in our job application and/or interview process. If you need assistance or accommodation in completing your application, due to a disability, please reach out to us via email. Please put “Reasonable Accommodation" in the subject line.
Business Area:
DATA STRATEGY