Workshop on Text Mining and Generation (TMG) @ KI 2022
- September 19, 2022
- Co-located with KI 2022
- Virtual (Hosted in Trier, Germany)
Digital text data is available in large amounts and different granularities. Typical sources include social media posts, books, news articles, web pages or company reports, etc. A major challenge this text data imposes is that it is unstructured and must first be processed to make further analysis possible. At the same time, there are also many situations in which only structured data is available that is to be verbally explained—for instance, by Explainable AI. These contrasting scenarios lead to two complementary application areas: text mining and text generation. The aim of text mining is to analyze the content of unstructured text and extract (useful) structured information. In contrast, text generation attempts to (automatically) create text from structured information or knowledge that is for example stored in large language models. The goal of the TMG workshop is to bring these two perspectives together by eliciting research paper submissions that aim for bridging the gap between knowledge extraction and text generation. Since recent approaches to text mining and text generation are predominantly based on artificial intelligence (AI) methodologies, KI 2022 is a relevant venue to bring together AI researchers working on these two tasks.
The proceedings are available at CEUR.
|10:10||11:45||Session 1: Original Papers|
|13:30||14:30||Keynote by Iryna Gurevych|
|14:45||15:45||Session 2: Invited Talks|
|15:45||16:15||Closing and Open Panel Discussion|
Call for Papers
We welcome any submissions that deal with transforming the representation of data using techniques of natural language processing (NLP): (applied) research papers, theoretical papers, user studies or prospective papers. Topics include, but are not limited to, the following:
- Answer generation for question answering.
- Argument mining.
- Ethical aspects of AI for text generation (e.g., mitigating bias, misinformation, etc.).
- Generating descriptions for graph-based workflows.
- Generating explanations in recommender systems.
- Graph-to-text generation for knowledge graphs.
- Methods for Explainable AI.
- Information extraction.
- Knowledge graph refinement, particularly featuring text-based signals.
- Parsing argumentative structures in texts.
- Pattern detection in log files.
- Snippet generation for search results.
- Workflow mining.
The submission of the papers should be in accordance to the GI-LNI style and have to be submitted via EasyChair (please select the track W6: Text Mining and Generation). Authors can submit three different types of papers:
- Full Paper (up to 12 pages, excluding references)
- Short Paper (up to 6 pages, excluding references)
- Extended Abstract (up to 3 pages, excluding references)
If selected for publication in the GI-LNI proceedings, authors will later get a possibility to submit an extended version of the paper disregarding their original submission format. The workshop is running a single-blind review process.
|August 1, 2022||Submission Due|
|August 31, 2022||Author Notification|
|September 11, 2022||Camera-Ready Version|
|September 19, 2022||Workshop Date|
Keynote by Prof. Dr. Iryna Gurevych
Iryna Gurevych (PhD 2003, U. Duisburg-Essen, Germany) is professor of Computer Science and director of the Ubiquitous Knowledge Processing (UKP) Lab at the Technical University (TU) of Darmstadt in Germany. Her main research interests are in machine learning for large-scale language understanding and text semantics. Iryna's work has received numerous awards. Examples are the ACL fellow award 2020 and the first Hessian LOEWE Distinguished Chair award (2,5 mil. Euro) in 2021. Iryna is co-director of the NLP program within ELLIS, a European network of excellence in machine learning. She is currently the vice-president of the Association of Computational Linguistics.
Detect—Verify—Communicate: Combating Misinformation with More Realistic NLP
Dealing with misinformation is a grand challenge of the information society directed at equipping the computer users with effective tools for identifying and debunking misinformation. Current Natural Language Processing (NLP) including its fact-checking research fails to meet the expectations of real-life scenarios. In this talk, we show why the past work on fact-checking has not yet led to truly useful tools for managing misinformation, and discuss our ongoing work on more realistic solutions. NLP systems are expensive in terms of financial cost, computation, and manpower needed to create data for the learning process. With that in mind, we are pursuing research on detection of emerging misinformation topics to focus human attention on the most harmful, novel examples. Automatic methods for claim verification rely on large, high-quality datasets. To this end, we have constructed two corpora for fact checking, considering larger evidence documents and pushing the state of the art closer to the reality of combating misinformation. We further compare the capabilities of automatic, NLP-based approaches to what human fact checkers actually do, uncovering critical research directions for the future. To edify false beliefs, we are collaborating with cognitive scientists and psychologists to automatically detect and respond to attitudes of vaccine hesitancy, encouraging anti-vaxxers to change their minds with effective communication strategies.
Organizing and Program Committee
Martin Luther University Halle-Wittenberg
Prof. Dr. Adrian Ulges
RheinMain University of Applied Sciences
Prof. Dr. Stephanie Evert
Prof. Dr. Lutz Schröder
Prof. Dr. Achim Rettinger
Prof. Dr. Martin Vogt
Trier University of Applied Sciences