Associate Director, Data Curator
Location: Macclesfield, UK or Gothenburg, SE (3 days on-site)
About AstraZeneca
AstraZeneca is a global, science-led, patient-focused biopharmaceutical company that focuses on the discovery, development and commercialisation of prescription medicines for some of the world’s most serious diseases. But we’re more than one of the world’s leading pharmaceutical companies. At AstraZeneca, we are pioneering new frontiers by identifying and treating patients earlier, working towards the aim of eliminating cancer as a cause of death.
Come and join our AZ team where you will play a pivotal role in this exciting period of development!
Are you ready to make a significant impact in the world of pharmaceuticals? We are on the hunt for a Associate Director, Data Curator to join our dynamic Pharmaceutical Technology & Development (PT&D) Data Governance team. In this pivotal role, you'll be responsible for interpreting and representing complex biological and medical data, crafting controlled vocabularies, and adhering to structured data models. Your work will enable interoperable data and enhance insights into the diverse data types crucial for pharmaceutical product development. Collaborate closely with PT&D business units and fellow Data Governance team members to drive improvements in data quality. Are you up for the challenge?
Key Responsibilities:
Assess and analyse demand from business teams to capture and develop controlled terminology within a cohesive data harmonization strategy.
Analyse and review datasets for meaning and context, ensuring curated datasets meet business needs.
Perform data profiling and data quality assessment.
Collaborate with a wide range of business and technical collaborators.
Explain and promote data curation concepts to business stakeholders.
Collaborate with business data specialists, including scientists and domain experts.
Collaborate with technical data specialists, including information architects, data modelers, ontology specialists, and knowledge graph experts.
Design, implement, and maintain controlled vocabularies; support the development of taxonomies and ontologies for PT&D.
Generate high-value, enriched data assets through in-depth curation of selected data assets.
Arrange data in a structured manner to facilitate easy access, integration, and analysis.
Implement data curation standards and best practices, including FAIR data principles (Findable, Accessible, Interoperable, Reusable).
Ensure high data quality and consistency by implementing quality checks and validation rules for ontology-driven systems and processes.
Maintain and update data repositories to ensure data accuracy and availability.
Use data curation tools such as Progress Semaphore to design, build, and maintain controlled vocabularies and collections.
Work with Data Governance team members to ensure policy and governance related to data standards implementation and curation are planned for and aligned with.
Continuously improve implementation processes and tools related to data standards and curation; make scientific data FAIR and AI/Analytics-ready.
Support data analytics and reporting efforts by providing curated datasets.
Deliver training and support to increase the utilization of data curation tools and platforms.
Adapt to future changes due to the introduction of automated tools and large language models (LLMs) and AI automation, anticipating significant evolution in responsibilities over the coming years.
Essential requirements:
Relevant degree in Life Sciences, Information/Data Science, Computer Science, Informatics, IT, or other related field
2-3 years of experience in data curation and management, ideally in the healthcare space
Operational understanding of common data models, data standards, vocabularies, taxonomies, ontologies etc., and their implementation within a centralized repository
Experience working with business and IT partners in the implementation of data standardization and curation projects
Ability to learn new tools and technologies
Experience in creating system and user documentation artifacts related to data standards implementation & curation
Proficient with: Semaphore (or similar), JIRA, Confluence and familiar with Collibra, Protege, Github
Data Quality tools: Acceldata
Proven interpersonal and communication skills to translate, promote and embed Data Governance and Data Standards ensuring buy-in across a large scale and diverse organisation
Problem-Solving: Proficiency in identifying issues, analysing situations, and developing effective solutions within the scope of data policy and governance
Experience in working in multi-skilled, multi-location data teams, working to agile principles
Excellent written and verbal communication, and consultancy skills proven strong Stakeholder Management skills and ability to influence
Desirables:
Experience with clinical data curation and familiarity with industry standards
Domain data understanding: the structure, provenance, and meaning of the source data crucial to the domain
Enterprise data management: Understanding of the concepts of data governance, data quality, and data architecture in a large, complex organization
Technology knowledge: Data technologies, including databases, data lakes, and data warehousing solutions, and tools for data visualization and reporting
Project management
In Office Requirement:
When we put unexpected teams in the same room, we unleash bold thinking with the power to inspire life-changing medicines. In-person working gives us the platform we need to connect, work at pace and challenge perceptions. That's why we work, on average, a minimum of three days per week from the office. But that doesn't mean we're not flexible. We balance the expectation of being in the office while respecting individual flexibility. Join us in our unique and ambitious world.
Competitive salary and benefits package on offer.