Frühjahrstreffen der FG Datenbanken

Name: Frühjahrstreffen der FG Datenbanken
Start: 2024-03-11T13:00:00+01:00
End: 2024-03-12T13:00:00+01:00
Location: Universitätshauptgebäude

Mar 11 – 12, 2024

Universitätshauptgebäude

Europe/Berlin timezone

Contact

birgitta.koenig-ries@uni-jena.de

Session

Talks

Mar 11, 2024, 1:00 PM

HS 024 (Universitätshauptgebäude)

HS 024

Universitätshauptgebäude

Fürstengraben 1 07743 Jena

There are no materials yet.

28. Welcome

Birgitta König-Ries (Heinz Nixdorf Chair for Distributed Information Systems)

3/11/24, 1:00 PM
9. On the Path to a Quality Indicator for Software and Data Publications for the Helmholtz

Marcel Meistring (Helmholtz Open Science Office)

3/11/24, 1:15 PM

Vortrag

Talk

Research data and software publications have become a regular output of scientific work. Yet unlike more traditional text publications, widely established processes to assess and evaluate their quality are still missing. This fact prevents researches from getting the proper credit they deserve as common performance indicators often just omit this part of scientific contributions.
As part of...
Go to contribution page
13. From theory to practice - Advancing Research Assessment for Incentives at Charité and BIH through infrastructure

Miriam Kip (BIH Charité)

3/11/24, 2:00 PM

Vortrag

Talk

There is a gap between current responsible research and innovation (RRI) as well as open sciences (OS) practices and assessment practices. While
research practices and their ways of publication and dissemination have diversified, assessment practices have remained narrow – focusing on criteria of publication quantity and reputation. In my talk, I will discuss two projects. The first project...
Go to contribution page
8. Terminologies in database systems

Felix Engel (TIB)

3/11/24, 3:00 PM

Vortrag

Talk

The use of commonly agreed terminologies is an elementary component of database systems. They have an impact on data consistency, querying and retrieval or interoperability. Creating, searching for and agreeing on a terminology to be used is a non-trivial problem, as it requires specialised knowledge and coordination processes. This presentation introduces the terminology service that deals...
Go to contribution page
7. Medax - a knowledge graph for biomedicine

Judith Wodke (U Greifswald)

3/11/24, 3:30 PM

Vortrag

Talk

Within the MeDaX project we study bioMedical Data eXploration using graph technologies. We design and implement efficient concepts and tools for integration, enrichment, scoring, retrieval, and analysis of biomedical data. Interested in data similarity and quality measures, we initiated an international community project for biomedical provenance standardisation and cooperate within the...
Go to contribution page
6. Schema Evolution in Research Data

Tanja Auge (U Regensburg)

3/11/24, 4:00 PM

Vortrag

Talk

Changes occur frequently, especially in data-driven long-term studies. Changing databases lead to the accumulation of many schemes and instances over time. However, any scientific application must be able to reconstruct the historical data to ensure the reproducibility or at least the explainability of the research results. A method is needed that allows each database version to be easily...
Go to contribution page
14. Democratising data analysis with Galaxy

Björn Grüning

3/11/24, 5:00 PM

Vortrag

Talk

Galaxy is an open-source platform that allows researchers to analyze and share scientific data using interoperable APIs and various user-friendly web-based interfaces. The Galaxy project was launched in 2005 and has since become a powerful tool for researchers across a wide range of research fields, including *omics, biodiversity, machine learning, cheminformatics, NLP, material science,...
Go to contribution page
5. From Research Data Management to Data Platforms: A Hugging Face Approach

Michael Gertz (U Heidelberg)

3/11/24, 5:45 PM

Vortrag

Talk

Does research data management as we know it in the context
of database research or data science need platforms like Hugging Face?
Or are platforms and services such as Kaggle or GESIS sufficient? In
this talk, after giving a brief overview of the core features of
Hugging Face, we claim that the data research community would benefit
a lot from a platform similar to Hugging Face, in...
Go to contribution page
23. Snowflake Berlin

Dirk Junghanns (Snowflake)

3/11/24, 6:15 PM

Vortrag

Talk

Im Vortrag wird Snowflake kurz vorgestellt und Herausforderungen im Bereich Datenbanken aufgezeigt, an denen wir derzeit arbeiten. Auch kurz das Snowflake Academia Programm wird vorgestellt.
Go to contribution page
16. Problems and Issues in Biodiversity Data Infrastructures

Bernhard Seeger (U Marburg)

3/12/24, 9:00 AM

Talk

The current biodiversity crisis has triggered an extreme need for a better understanding of the network of life on Earth. Efficient data management is crucial in biodiversity and is the backbone for a digital twin of past, present, and future life. The Research Data Commons (RDC) is the central cloud-based information system architecture of NFDI4Biodiversity, the consortia of the NFDI...
Go to contribution page
29. Flashtalks

3/12/24, 9:30 AM

1 Minute Teasers presenting the posters
Go to contribution page
10. Tabular Data Synthesis for Data Management

Fabian Panse (HPI)

3/12/24, 11:15 AM

Vortrag

Talk

The problem of generating synthetic data is almost as old as modern research itself. However, with the advent of generative AI, new possibilities for synthesizing tabular data have emerged that go far beyond the capabilities of traditional statistical or rule-based approaches. Most of this new research comes from the ML community, where ML models need to be fed with useful training data. Since...
Go to contribution page
15. Exploring Computational Reproducibility in Jupyter Notebooks: Insights and Challenges

Sheeba Samuel (Friedrich Schiller University)

3/12/24, 12:00 PM

Vortrag

Talk

Reproducible research emphasizes the importance of documenting and publishing scientific results in a manner that enables others to verify and extend them. In this talk, we explore computational reproducibility within the context of Jupyter notebooks, presenting insights and challenges from our study. We will present the key steps of the pipeline we used for assessing the reproducibility of...
Go to contribution page
11. Research Data Management in TIRA for Reproducible Shared Tasks

Maik Fröbe (U Jena)

3/12/24, 12:30 PM

Vortrag

Talk

TIRA is a platform to organize shared tasks with software
submissions, mostly in information retrieval and natural language
processing. Due to the software submissions, TIRA allows blinded
experimentation on (confidential) datasets to which participants have no
access. After a shared task, the artifacts of the shared tasks, i.e.,
research data in the form of submitted software, inputs,...
Go to contribution page

Building timetable...

Choose timezone

Frühjahrstreffen der FG Datenbanken

Contact

Presentation materials