IOW Logo

News

Can AI help make sequence databases more accessible?

Group photo of the participants in a workshop organized as part of an NFDI4Biodiversity network activity on the topic “AI Tools for Metadata Extraction in Sequence Databases”
Participants in the NFDI4Biodiversity network activity workshop on “AI Tools for Metadata Extraction in Sequence Databases.” There is also lively online participation. (Photo: IOW / K. Beck)

On June 25 and 26, 2026, the IOW will host the first workshop of an NFDI4Biodiversity topic table on the subject of “AI Tools for Metadata Extraction in Sequence Databases.” Around 30 experts in research data management, science and business from 17 institutions in Germany and beyond have come together to discuss how metadata in sequence data archives can be made more readily searchable, comparable, and easier to use for specific research questions in the future through the use of artificial intelligence (AI).

Sequence data – digital information about genetic material in samples – is a key resource for today's biodiversity research. These data are generated on a large scale worldwide and made available in public repositories. However, their scientific utilization by researchers who build upon this work depends crucially on whether they are linked to meaningful, standardized, and machine-readable metadata – that is, descriptive supplementary information. Often, this information – such as details on sample origin, environmental conditions, analytical methods, or research context – is either incomplete or available only as free-text data that are difficult to analyse. AI-supported methods are now expected to open up new possibilities for systematically extracting and structuring such information and for sustainably enhancing existing datasets.

The workshop, which kicked off today at the IOW, brings together all stakeholders in the field of sequence data research: scientists who generate these data and those who wish to reuse them, specialists in research data management, and AI experts specializing in data extraction. The focus is on questions such as which tools and approaches already exist, what developments will be needed in the future, and how challenges such as quality assurance, compatibility, and long-term implementation can be addressed collaboratively. Guest speakers from Wismar, Leipzig, and Hinxton, UK, will provide additional input for the discussion.

The workshop is organized and hosted by IOW researcher and bioinformatics expert Christiane Hassenrück. “With this workshop, we don’t just want to discuss individual AI tools; we want to lay the groundwork for coordinated development,” says Hassenrück. “If we succeed in bringing together expertise in data management, AI development, and research practice, we can make sequence data archives significantly more usable in the long term – and thereby also strengthen biodiversity research.”

The workshop marks the start of a series of activities within the NFDI4Biodiversity topic table on metadata extraction from sequence databases. Over the coming year, the results of this meeting and two additional workshops will be compiled into a white paper. The goal is to define common standards, identify strategic development needs, and outline clear steps for sustainable metadata improvement in sequence databases.

NFDI4Biodiversity is a consortium under the umbrella of the nonprofit National Research Data Infrastructure (NFDI) association (www.nfdi.de/?lang=en) dedicated to the collaborative use of biodiversity and environmental data. The network brings together the scientific and technical expertise of approximately 60 partner institutions to make data more accessible, reliable, and reusable for research, nature conservation, and policy-making.

Lead / Contact Persons for the NFDI4Biodiversity Network Activity “AI Tools for Metadata Extraction in Sequence Databases”:
Prof. Dr. Birgit Gemeinholzer, University of Kassel: Birgnullit.Gemeinholzer@uni-kassel.de
Dr. Christiane Hassenrück, Leibniz Institute for Baltic Sea Research Warnemünde: chrinullstiane.hassenrueck@iow.de
Dr. Stephanie Jurburg, Helmholtz Centre for Environmental Research: stepnullhanie.jurburg@ufz.de

Go back