Publications

Library and Information Science | Publications

Publications

The publications of the Information Processing and Analysis group are listed in BibSonomy and embedded here:

2025

Advances and Challenges in the Automatic Identification of Indirect Quotations in Scholarly Texts and Literary Works.
In: Proceedings of the 5th Workshop on Natural Language Processing for Digital Humanities. 2025.
Frederik Arnold, Robert Jäschke and Philip Kraut.
[BibTeX]

A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity Detection .
In: Proceedings of the 31st International Conference on Computational Linguistics, series COLING'25, pages 9845–9859. Association for Computational Linguistics, 2025.
Simon Hachmeier and Robert Jäschke.
[doi] [abstract] [BibTeX]

On the Robustness of Cover Version Identification Models: A Study Using Cover Versions from YouTube.
In: Proceedings of the iConference . Bloomington, IN, 2025.
Simon Hachmeier and Robert Jäschke.
[BibTeX]

2024

Bridging the Analytics Gap: Optimizing Content Performance using Actionable Knowledge Discovery.
In: Proceedings of the 35th ACM Conference on Hypertext and Social Media, series HT '24, pages 185–192. Association for Computing Machinery, New York, NY, USA, 2024.
Tom Alby.
[doi] [abstract] [BibTeX]

Information extraction of music entities in conversational music queries.
In: Proceedings of the 3rd Workshop on NLP for Music and Audio, series NLP4MusA'24, pages 37-42. Association for Computational Lingustics, 2024.
Simon Hachmeier and Robert Jäschke.
[doi] [abstract] [BibTeX]

Leveraging User-Generated Metadata of Online Videos for Cover Song Identification.
In: Proceedings of the 3rd Workshop on NLP for Music and Audio, series NLP4MusA'24, pages 43-48. Association for Computational Lingustics, 2024.
Simon Hachmeier and Robert Jäschke.
[doi] [abstract] [BibTeX]

A Repository for Formal Contexts.
In: I. P. Cabrera, S. Ferré and S. Obiedkov, editors, Conceptual Knowledge Structures, series Lecture Notes in Artificial Intelligence, pages 182-197. Springer Nature Switzerland, Cham, 2024.
Tom Hanika and Robert Jäschke.
[abstract] [BibTeX]

Literatur im Wikiversum – Eine praktische Annäherung über API-Abfragen und Wikipedia-Metriken.
In: Konferenzabstracts der DHd 2024, pages 49-53. 2024.
Viktor Illmer, Bart Soethaert, Lilly Welz, Frank Fischer and Robert Jäschke.
[doi] [abstract] [BibTeX]

2023

Popular, but Hardly Used: Has Google Analytics Been to the Detriment of Web Analytics?.
In: Proceedings of the 15th ACM Web Science Conference 2023, series WebSci '23, pages 304–311. Association for Computing Machinery, New York, NY, USA, 2023.
Tom Alby.
[doi] [abstract] [BibTeX]

A Novel Approach for Identification and Linking of Short Quotations in Scholarly Texts and Literary Works.
Journal of Computational Literary Studies, 2(1), 2023.
Frederik Arnold and Robert Jäschke.
[doi] [abstract] [BibTeX]

Ein Quantum Literatur. Empirische Daten zu einer Theorie des literarischen Textumfangs.
In: F. Jannidis, editor, Digitale Literaturwissenschaft, pages 777-812. J.B. Metzler, Stuttgart, 2023.
Frank Fischer and Robert Jäschke.
[doi] [abstract] [BibTeX]

Preface: World Literature in an Expanding Digital Space.
Journal of Cultural Analytics, 8(2), 2023.
Frank Fischer, Jacob Blakesley, Paula Wojcik and Robert Jäschke.
[doi] [abstract] [BibTeX]

Cover Song Identification in Practice with Multimodal Co-Training.
In: M. Leyer and J. Wichmann, editors, Proceedings of the Conference on ``Lernen, Wissen, Daten, Analysen'', series CEUR Workshop Proceedings, pages 359-371. Aachen, 2023.
Simon Hachmeier and Robert Jäschke.
[doi] [abstract] [BibTeX]

Graph-Based Representation and Reasoning.
Lecture Notes in Computer Science. volume 14133. Springer, Cham, 2023.
Manuel Ojeda-Aciego, Kai Sauerwald and Robert Jäschke.
[doi] [abstract] [BibTeX]

Annotated Vossian Antonomasia Dataset.
2023.
Michel Schwab, Robert Jäschke and Frank Fischer.
[doi] [abstract] [BibTeX]

»Die Greta Garbo der Leichtathletik« – Eine systematische Analyse der Modifier vossianischer Antonomasien mithilfe von Word Embeddings.
In: Proceedings of the DHd, series DHd'23. 2023.
Michel Schwab and Frank Fischer.
[doi] [BibTeX]

»Japan’s Answer to Mozart«: Automatic Detection of Generalized Patterns of Vossian Antonomasia.
In: M. Abbas and A. A. Freihat, editors, Proceedings of the 6th International Conference on Natural Language and Speech Processing, series ICNLSP'23, pages 99-109. Association for Computational Linguistics, 2023.
Michel Schwab, Robert Jäschke and Frank Fischer.
[doi] [abstract] [BibTeX]

»Who is the Madonna of Italian-American Literature?«: Extracting and Analyzing Target Entities of Vossian Antonomasia.
In: Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pages 110-115. Association for Computational Linguistics, 2023.
Michel Schwab, Robert Jäschke and Frank Fischer.
[doi] [abstract] [BibTeX]

2022

Analyzing the Web: Are Top Websites Lists a Good Choice for Research?.
In: Proceedings of the International Conference on Theory and Practice of Digital Libraries, series TPDL '22, pages 11-25. Springer, Cham, 2022.
Tom Alby and Robert Jäschke.
[doi] [abstract] [BibTeX]

A Game with Complex Rules: Literature References in Literary Studies .
In: Proceedings of the Workshop Understanding LIterature references in academic full TExt at JCDL 2022, volume 3220, series ULITE-ws '22, pages 7-15. CEUR Workshop Proceedings, 2022.
Frederik Arnold and Robert Jäschke.
[doi] [abstract] [BibTeX]

Salience in Literary Texts: A Combined Approach to the Relevance of Passages.
In: DH2022. 2022.
Frederik Arnold, Benjamin Fiechter, Evelyn Gius, Robert Jäschke, Steffen Martus and Michael Vauth.
[BibTeX]

Graph-Based Representation and Reasoning: Proceedings of the 27th International Conference on Conceptual Structures.
Lecture Notes in Computer Science. volume 13403. Springer Cham, 2022.
Tanya Braun, Diana Cristea and Robert Jäschke.
[doi] [abstract] [BibTeX]

Music Version Retrieval from YouTube: How to Formulate Effective Search Queries?.
In: P. Reuss, V. Eisenstadt, J. Schönborn and J. Schäfer, editors, Proceedings of the Conference on ``Lernen, Wissen, Daten, Analysen'', series CEUR Workshop Proceedings, pages 213-226. Aachen, 2022.
Simon Hachmeier, Robert Jäschke and Hadi Saadatdoorabi.
[doi] [abstract] [BibTeX]

»Der Frank Sinatra der Wettervorhersage« – Cross-Lingual Vossian Antonomasia Extraction.
In: Proceedings of the 5th International Conference on Natural Language and Speech Processing, series ICNLSP'22, pages 282-287. Association for Computational Linguistics, 2022.
Michel Schwab, Robert Jäschke and Frank Fischer.
[doi] [abstract] [BibTeX]

»The Rodney Dangerfield of Stylistic Devices« – End-to-End Detection and Extraction of Vossian Antonomasia Using Neural Networks.
Frontiers in Artificial Intelligence , 5, 2022.
Michel Schwab, Robert Jäschke and Frank Fischer.
[doi] [abstract] [BibTeX]

Where are the Datasets? A case study on the German Academic Web Archive.
In: Proceedings of the Web Archiving and Digital Libraries Workshop at JCDL 2022. 2022.
Yousef Younes, Sebastian Tiesler, Robert Jäschke and Brigitte Mathiak.
[abstract] [BibTeX]

2021

Lotte and Annette: A Framework for Finding and Exploring Key Passages in Literary Works.
In: Proceedings of the Workshop on Natural Language Processing for Digital Humanities at ICON 2021, pages 55-63. NLP Association of India, 2021.
Frederik Arnold and Robert Jäschke.
[doi] [BibTeX]

Proximity dimensions and the emergence of collaboration: a HypTrails study on German AI research.
Scientometrics, 126:9847-9868, 2021.
Tobias Koopmann, Maximilian Stubbemann, Matthias Kapa, Michael Paris, Guido Buenstorf, Tom Hanika, Andreas Hotho, Robert Jäschke and Gerd Stumme.
[doi] [abstract] [BibTeX]

Evaluating dataset creation heuristics for concept detection in web pages using BERT.
In: Proceedings of the 14th International Conference on Knowledge Science, Engineering and Management, volume 12816, series Lecture Notes in Artificial Intelligence, pages 1-14. Springer, 2021.
Michael Paris and Robert Jäschke.
[abstract] [BibTeX]

2020

To Follow Or To Unfollow: Motives For The Academic Use Of Twitter.
In: Proceedings of the 14th International Technology, Education and Development Conference, series INTED, pages 1009-1018. IATED, 2020.
S.B. Linek, C.P. Hoffmann and R. Jäschke.
[doi] [abstract] [BibTeX]

Twitter appears to be a popular social media service for academics, especially computer scientists. While some studies have begun to examine motives for academic Twitter use, little is known about academics’ considerations for following and unfollowing other users. Our empirical study explored general motives for the academic use of the social media platform Twitter. Based on the uses and gratifications theory and prior research as well as a review of existing scales, we designed a detailed questionnaire on motives for the academic use of Twitter. Besides the general motives for the academic use of Twitter we also analyzed subjective considerations for following and unfollowing accounts. The latter questions aimed at deeper insights in the networking behavior on Twitter and a better understanding of the adoption of social media in academia and their potential influence on the research process. The online survey was presented to 54 computer scientists that were active on Twitter. Results show that academic Twitter use is generally characterized by information motives as well as by various social considerations. As the main reasons for using Twitter, we identify dissemination and, to a lesser degree, collection of information. However, users are also motivated by community development considerations. Accordingly, when following an account, users do not only look for content that is informative, interesting, of high quality, and current. They also tend to follow an account whose owner shares similar research interests, is an important researcher in the field, and that is personally known and liked. Unfollowing, while rather ubiquitous, is largely driven by considerations of content. To summarize, we find that academics subjective considerations oscillates between content and personal aspects, with content aspects driving usage, but personal aspects also shaping following decisions. These insights contribute to the current state of research on motives of academic Twitter usage finding that information and community development motives play central roles in the ensuing communication behavior and structures. Although previous studies have found that academic hierarchies are replicated in online social networking structures, our findings imply that this influence may be mediated by information considerations: wishing to collect helpful information on Twitter, academics tend to follow well-known colleagues in the field. However, the results of our survey suggest that the academic status of an account owner per se is not an important factor in following decisions. As this study focused on computer scientists on Twitter, it is an open question if and to what extend the findings are valid for other disciplines and other social media. A more comprehensive analysis involving other disciplines and also the simultaneous use of various social media would provide a more holistic view of the academic use of social media.

How to Assess the Exhaustiveness of Longitudinal Web Archives.
In: Proceedings of the 31st ACM Conference on Hypertext and Social Media. ACM, 2020.
Michael Paris and Robert Jäschke.
[doi] [BibTeX]

How to Assess the Exhaustiveness of Longitudinal Web Archives: A Case Study of the German Academic Web.
In: Proceedings of the 31st ACM Conference on Hypertext and Social Media, series HT ’20. ACM, New York, NY, USA, 2020.
Michael Paris and Robert Jäschke.
[abstract] [BibTeX]

2019

»The Michael Jordan of greatness« – Extracting Vossian antonomasia from two decades of The New York Times, 1987–2007.
Digital Scholarship in the Humanities, 35(1):34–42, 2019.
Frank Fischer and Robert Jäschke.
[doi] [abstract] [BibTeX]

Proceedings of the Conference on "Lernen, Wissen, Daten, Analysen".
CEUR Workshop Proceedings. number 2454. Aachen, 2019.
Robert Jäschke and Matthias Weidlich.
[doi] [BibTeX]

»A Buster Keaton of Linguistics« – First Automated Approaches for the Extraction of Vossian Antonomasia.
2019.
Michel Schwab, Robert Jäschke, Frank Fischer and Jannik Strötgen.
[doi] [BibTeX]

»A Buster Keaton of Linguistics« – First Automated Approaches for the Extraction of Vossian Antonomasia.
In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, series EMNLP '19, pages 6239-6244. Association for Computational Linguistics, 2019.
Michel Schwab, Robert Jäschke, Frank Fischer and Jannik Strötgen.
[doi] [abstract] [BibTeX]

2018

Proceedings of the International Workshop on Bias in Information, Algorithms, and Systems (BIAS).
CEUR Workshop Proceedings. number 2103. Aachen, 2018.
Jo Bates, Paul D. Clough, Robert Jäschke and Jahna Otterbacher.
[doi] [BibTeX]

Towards Bias Detection in Online Text Corpora.
In: International Workshop on Bias in Information, Algorithms, and Systems (BIAS), series CEUR Workshop Proceedings, pages 19-23. Aachen, 2018.
Christoph Hube, Besnik Fetahu and Robert Jäschke.
[doi] [BibTeX]

Datenschätze selber heben: Data Science und Bibliotheken.
2018.
Robert Jäschke.
[doi] [abstract] [BibTeX]

Liebe und Tod in der Deutschen Nationalbibliothek: Der DNB-Katalog als Forschungsobjekt der digitalen Literaturwissenschaft .
In: Konferenzabstracts der DHd 2018, series DHd'18, pages 261-266. 2018.
Robert Jäschke and Frank Fischer.
[doi] [BibTeX]

»Der Henry Ford des Computerzeitalters« – Ein Vossanto-Memory.
2018.
Robert Jäschke and Frank Fischer.
[doi] [abstract] [BibTeX]

2017

World Literature According to Wikipedia: Introduction to a DBpedia-Based Framework.
2017.
Christoph Hube, Frank Fischer, Robert Jäschke, Gerhard Lauer and Mads Rosendahl Thomsen.
[doi] [abstract] [BibTeX]

New media, familiar dynamics: academic hierarchies influence academics' following behaviour on Twitter.
2017.
Robert Jäschke, Stephanie B. Linek and Christian P. Hoffmann.
[doi] [abstract] [BibTeX]

»Der Helmut Kohl unter den Brotaufstrichen« – Zur Extraktion Vossianischer Antonomasien aus großen Zeitungskorpora.
In: Proceedings of the DHd 2017, series DHd '17, pages 120-124. 2017.
Robert Jäschke, Jannik Strötgen, Elena Krotova and Frank Fischer.
[BibTeX]

It’s all about information? The Following Behaviour of Professors and PhD Students on Twitter.
The Journal of Web Science, 3(1):1-15, 2017.
Stephanie Linek, Asmelash Teka Hadgu, Christian Pieter Hoffmann, Robert Jäschke and Cornelius Puschmann.
[doi] [abstract] [BibTeX]

What do computer scientists tweet? Analyzing the link-sharing practice on Twitter.
PLoS ONE, 12(6), 2017.
Marco Schmitt and Robert Jäschke.
[doi] [abstract] [BibTeX]

2016

Tweeting in times of exposure: A mixed-methods approach for exploring patterns of communication related to business scandals on Twitter.
In: Proceedings of the Workshop on Natural Language Processing and Computational Social Science, series NLP+CSS at WebSci. Hannover, Germany, 2016.
Jens Bergmann, Asmelash Teka Hadgu and Robert Jäschke.
[abstract] [BibTeX]

Cäsar Flaischlens Graphische Litteratur-Tafel digital.
Poster at 3rd DHA conference. 2016.
Ingo Börner, Frank Fischer, Angelika Hechtl, Robert Jäschke and Peer Trilcke.
[doi] [BibTeX]

The Role of Cores in Recommender Benchmarking for Social Bookmarking Systems.
Transactions on Intelligent Systems and Technology, 7(3):40:1-40:33, 2016.
Stephan Doerfel, Robert Jäschke and Gerd Stumme.
[doi] [abstract] [BibTeX]

Telling English Tweets Apart: the Case of US, GB, AU.
In: Proceedings of the Workshop on Natural Language Processing and Computational Social Science, series NLP+CSS at WebSci. Hannover, Germany, 2016.
Asmelash Teka Hadgu, Netaya Lotze and Robert Jäschke.
[abstract] [BibTeX]

The 8th ACM Web Science Conference 2016.
SIGWEB Newsletter(Summer):1:1-1:7, 2016.
Robert Jäschke.
[doi] [abstract] [BibTeX]

You Shall Not Pass: Detecting Malicious Users at Registration Time.
In: Proceedings of the 1st International Workshop on Online Safety, Trust and Fraud Prevention, series OnSt '16, pages 2:1-2:6. ACM, New York, NY, USA, 2016.
Christian Kater and Robert Jäschke.
[doi] [abstract] [BibTeX]

Posted, Visited, Exported: Altmetrics in the Social Tagging System BibSonomy.
Journal of Informetrics, 10(3):732-749, 2016.
Daniel Zoller, Stephan Doerfel, Robert Jäschke, Gerd Stumme and Andreas Hotho.
[doi] [abstract] [BibTeX]

2015

Social Activity versus Academic Activity: A Case Study of Computer Scientists on Twitter.
In: Proceedings of the 15th International Conference on Knowledge Technologies and Data-driven Business, series i-KNOW '15. ACM, New York, NY, USA, 2015.
Subhash Chandra Pujari, Asmelash Teka Hadgu, Elisabeth Lex and Robert Jäschke.
[doi] [abstract] [BibTeX]

Semantic Annotation for Microblog Topics Using Wikipedia Temporal Information.
In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 97-106. Association for Computational Linguistics, 2015.
Tuan Tran, Nam-Khanh Tran, Asmelash Teka Hadgu and Robert Jäschke.
[doi] [abstract] [BibTeX]

On Publication Usage in a Social Bookmarking System.
In: Proceedings of the ACM Web Science Conference, series WebSci '15, pages 67:1-67:2. ACM, New York, NY, USA, 2015.
Daniel Zoller, Stephan Doerfel, Robert Jäschke, Gerd Stumme and Andreas Hotho.
[doi] [abstract] [BibTeX]

2014

Literatur recherchieren und verwalten.
In: CoScience - Gemeinsam forschen und publizieren mit dem Netz, chapter 1, pages 12-20. Technische Informationsbibliothek, Hannover, 2014.
Ina Blümel, Christian Hauschke and Robert Jäschke.
[doi] [abstract] [BibTeX]

The Quest for Research Information.
In: Proceedings of the 12th International Conference on Current Research Information Systems. 2014.
Ina Blümel, Stefan Dietze, Lambert Heller, Robert Jäschke and Martin Mehlberg.
[doi] [abstract] [BibTeX]

UMAP 2014 Extended Proceedings.
volume 1181. CEUR-WS, 2014.
Iván Cantador, Min Chi, Rosta Farzan and Robert Jäschke.
[doi] [abstract] [BibTeX]

Proceedings of the ECML PKDD Discovery Challenge 2013 - Recommending Given Names.
CEUR-WS.org. volume 1120. 2014.
Stephan Doerfel, Andreas Hotho, Robert Jäschke, Folke Mitzlaff and Juergen Mueller.
[doi] [BibTeX]

Identifying and Analyzing Researchers on Twitter.
In: Proceedings of the 2014 ACM Conference on Web Science, series WebSci '14, pages 23-30. ACM, New York, NY, USA, 2014.
Asmelash Teka Hadgu and Robert Jäschke.
[doi] [abstract] [BibTeX]

Graph-Based Representation and Reasoning.
Lecture Notes in Computer Science. volume 8577. Springer, 2014.
Nathalie Hernandez, Robert Jäschke and Madalina Croitoru.
[doi] [abstract] [BibTeX]

2013

An analysis of tag-recommender evaluation procedures.
In: Proceedings of the 7th ACM conference on Recommender systems, series RecSys '13, pages 343-346. ACM, New York, NY, USA, 2013.
Stephan Doerfel and Robert Jäschke.
[doi] [abstract] [BibTeX]

Attribute Exploration on the Web.
In: P. Cellier, F. Distel and B. Ganter, editors, Contributions to the 11th International Conference on Formal Concept Analysis, pages 19-34. 2013.
Robert Jäschke and Sebastian Rudolph.
[doi] [abstract] [BibTeX]

Deeper Into the Folksonomy Graph: FolkRank Adaptations and Extensions for Improved Tag Recommendations.
cs.IR, 1310.1498, 2013.
Nikolas Landia, Stephan Doerfel, Robert Jäschke, Sarabjot Singh Anand, Andreas Hotho and Nathan Griffiths.
[doi] [abstract] [BibTeX]

2012

Recommender Systems for Social Tagging Systems.
2012.
L. Balby Marinho, A. Hotho, R. Jäschke, A. Nanopoulos, S. Rendle, L. Schmidt-Thieme, G. Stumme and P. Symeonidis.
[doi] [abstract] [BibTeX]

Leveraging Publication Metadata and Social Data into FolkRank for Scientific Publication Recommendation .
In: Proceedings of the 4th ACM RecSys workshop on Recommender systems and the social web, pages 9-16. ACM, New York, NY, USA, 2012.
Stephan Doerfel, Robert Jäschke, Andreas Hotho and Gerd Stumme.
[doi] [abstract] [BibTeX]

Publication Analysis of the Formal Concept Analysis Community.
In: F. Domenach, D. Ignatov and J. Poelmans, editors, Formal Concept Analysis, volume 7278, series Lecture Notes in Artificial Intelligence, pages 77-95. Springer, Berlin/Heidelberg, 2012.
Stephan Doerfel, Robert Jäschke and Gerd Stumme.
[doi] [abstract] [BibTeX]

Challenges in Tag Recommendations for Collaborative Tagging Systems.
In: J. J. Pazos Arias, A. Fernández Vilas and R. P. Díaz Redondo, editors, Recommender Systems for the Social Web, pages 65-87. Springer, Berlin/Heidelberg, 2012.
Robert Jäschke, Andreas Hotho, Folke Mitzlaff and Gerd Stumme.
[doi] [abstract] [BibTeX]

Extending FolkRank with Content Data.
In: Proceedings of the 4th ACM RecSys workshop on Recommender systems and the social web, pages 1-8. ACM, New York, NY, USA, 2012.
Nikolas Landia, Sarabjot Singh Anand, Andreas Hotho, Robert Jäschke, Stephan Doerfel and Folke Mitzlaff.
[doi] [abstract] [BibTeX]

2011

Enhancing Social Interactions at Conferences.
Information Technology, 53(3):101-107, 2011.
Martin Atzmueller, Dominik Benz, Stephan Doerfel, Andreas Hotho, Robert Jäschke, Bjoern Elmar Macek, Folke Mitzlaff, Christoph Scholz and Gerd Stumme.
[doi] [abstract] [BibTeX]

Social Tagging Recommender Systems.
In: F. Ricci, L. Rokach, B. Shapira and P. B. Kantor, editors, Recommender Systems Handbook, pages 615-644. Springer, New York, 2011.
Leandro Balby Marinho, Alexandros Nanopoulos, Lars Schmidt-Thieme, Robert Jäschke, Andreas Hotho, Gerd Stumme and Panagiotis Symeonidis.
[doi] [abstract] [BibTeX]

Recommendation in the Social Web.
AI Magazine, 32(3):46-56, 2011.
Robin Burke, Jonathan Gemmell, Andreas Hotho and Robert Jäschke.
[doi] [abstract] [BibTeX]

A Comparison of Content-Based Tag Recommendations in Folksonomy Systems.
In: K. E. Wolff, D. E. Palchunov, N. G. Zagoruiko and U. Andelfinger, editors, Knowledge Processing and Data Analysis, volume 6581, series Lecture Notes in Computer Science, pages 136-149. Springer, Berlin/Heidelberg, 2011.
Jens Illig, Andreas Hotho, Robert Jäschke and Gerd Stumme.
[doi] [abstract] [BibTeX]

Formal Concept Analysis and Tag Recommendations in Collaborative Tagging Systems.
2011.
Robert Jäschke.
[doi] [abstract] [BibTeX]

One of the most noticeable innovation that emerged with the advent of the Web 2.0 and the focal point of this thesis are collaborative tagging systems. They allow users to annotate arbitrary resources with freely chosen keywords, so called tags. The tags are used for navigation, finding resources, and serendipitous browsing and thus provide an immediate benefit for the user. By now, several systems for tagging photos, web links, publication references, videos, etc. have attracted millions of users which in turn annotated countless resources. Tagging gained so much popularity that it spread into other applications like web browsers, software packet managers, and even file systems. Therefore, the relevance of the methods presented in this thesis goes beyond the Web 2.0. The conceptual structure underlying collaborative tagging systems is called folksonomy. It can be represented as a tripartite hypergraph with user, tag, and resource nodes. Each edge of the graph expresses the fact that a user annotated a resource with a tag. This social network constitutes a lightweight conceptual structure that is not formalized, but rather implicit and thus needs to be extracted with knowledge discovery methods. In this thesis a new data mining task – the mining of all frequent tri-concepts – is presented, together with an efficient algorithm for discovering such implicit shared conceptualizations. Our approach extends the data mining task of discovering all closed itemsets to three-dimensional data structures to allow for mining folksonomies. Extending the theory of triadic Formal Concept Analysis, we provide a formal definition of the problem, and present an efficient algorithm for its solution. We show the applicability of our approach on three large real-world examples and thereby perform a conceptual clustering of two collaborative tagging systems. Finally, we introduce neighborhoods of triadic concepts as basis for a lightweight visualization of tri-lattices. The social bookmark and publication sharing system BibSonomy, which is currently among the three most popular systems of its kind, has been developed by our research group. Besides being a useful tool for many scientists, it provides interested researchers a basis for the evaluation and integration of their knowledge discovery methods. This thesis introduces BibSonomy as an exemplary collaborative tagging system and gives an overview of its architecture and some of its features. Furthermore, BibSonomy is used as foundation for evaluating and integrating some of the discussed approaches. Collaborative tagging systems usually include tag recommendation mechanisms easing the process of finding good tags for a resource, but also consolidating the tag vocabulary across users. In this thesis we evaluate and compare several recommendation algorithms on large-scale real-world datasets: an adaptation of user-based Collaborative Filtering, a graph-based recommender built on top of the FolkRank algorithm, and simple methods based on counting tag co-occurences. We show that both FolkRank and Collaborative Filtering provide better results than non-personalized baseline methods. Moreover, since methods based on counting tag co-occurrences are computationally cheap, and thus usually preferable for real time scenarios, we discuss simple approaches for improving the performance of such methods. We demonstrate how a simple recommender based on counting tags from users and resources can perform almost as good as the best recommender. Furthermore, we show how to integrate recommendation methods into a real tagging system, record and evaluate their performance by describing the tag recommendation framework we developed for BibSonomy. With the intention to develop, test, and evaluate recommendation algorithms and supporting cooperation with researchers, we designed the framework to be easily extensible, open for a variety of methods, and usable independent from BibSonomy. We also present an evaluation of the framework which demonstrates its power. The folksonomy graph shows specific structural properties that explain its growth and the possibility of serendipitous exploration. Clicklogs of web search engines can be represented as a folksonomy in which queries are descriptions of clicked URLs. The resulting network structure, which we will term logsonomy is very similar to the one of folksonomies. In order to find out about its properties, we analyze the topological characteristics of the tripartite hypergraph of queries, users and bookmarks on a large folksonomy snapshot and on query logs of two large search engines. We find that all of the three datasets exhibit similar structural properties and thus conclude that the clicking behaviour of search engine users based on the displayed search results and the tagging behaviour of collaborative tagging users is driven by similar dynamics. In this thesis we further transfer the folksonomy paradigm to the Social Semantic Desktop – a new model of computer desktop that uses Semantic Web technologies to better link information items. There we apply community support methods to the folksonomy found in the network of social semantic desktops. Thus, we connect knowledge discovery for folksonomies with semantic technologies. Alltogether, the research in this thesis is centered around collaborative tagging systems and their underlying datastructure – folksonomies – and thereby paves the way for the further dissemination of this successful knowledge management paradigm.

Tagging data as implicit feedback for learning-to-rank.
In: Proceedings of the ACM WebSci Conference, pages 1-4. New York, NY, USA, 2011.
Beate Navarro Bullock, Robert Jäschke and Andreas Hotho.
[doi] [abstract] [BibTeX]

Formal Concept Analysis.
Lecture Notes in Artificial Intelligence. volume 6628. Springer, Berlin/Heidelberg, 2011.
Petko Valtchev and Robert Jäschke.
[doi] [abstract] [BibTeX]

The present volume features a selection of the papers presented at the 9th International Conference on Formal Concept Analysis (ICFCA 2011). Over the years, the ICFCA conference series has grown into the premier forum for dissemination of research on topics from formal concept analysis (FCA) theory and applications, as well as from the related fields of lattices and partially ordered structures. FCA is a multi-disciplinary field with strong roots in the mathematical theory of partial orders and lattices, with tools originating in computer science and artificial intelligence. FCA emerged in the early 1980s from efforts to restructure lattice theory to promote better communication between lattice theorists and potential users of lattice-based methods for data management. Initially, the central theme was the mathematical formalization of concept and conceptual hierarchy. Since then, the field has developed into a constantly growing research area in its own right with a thriving theoretical community and an increasing number of applications in data and knowledge processing including disciplines such as data visualization, information retrieval, machine learning, software engineering, data analysis, data mining, social networks analysis, etc. ICFCA 2011 was held from May 2 to May 6, 2011, in Nicosia, Cyprus. The program committee received 49 high-quality submissions that were subjected to a highly competitive selection process. Each paper was reviewed by three referees (exceptionally two or four). After a first round, some papers got a definitive acceptance status, while others got accepted conditionally to improvements in their content. The latter got to a second round of reviewing. The overall outcome was the acceptance of 16 papers as regular ones for presentation at the conference and publication in this volume. Another seven papers have still been assessed as valuable for discussion at the conference and were therefore collected in the supplementary proceedings. The regular papers presented hereafter cover advances on a wide range of subjects from FCA and related fields. A first group of papers tackled mathematical problems within the FCA field. A subset thereof focused on factor identification within the incidence relation or its lattice representation (papers by Glodeanu and by Krupka). The remainder of the group proposed characterizations of particular classes of ordered structures (papers by Doerfel and by Meschke et al.). A second group of papers addressed algorithmic problems from FCA and related fields. Two papers approached their problems from an algorithmic complexity viewpoint (papers by Distel and by Babin and Kuznetsov) while the final paper in this group addressed algorithmic problems for general lattices, i.e., not represented as formal contexts, with an FCA-based approach (work by Balcázar and Tîrnăucă). A third group studied alternative approaches for extending the expressive power of the core FCA, e.g., by generalizing the standard one-valued attributes to attributes valued in algebraic rings (work by GonzÃ¡lez Calabozo et al.), by introducing pointer-like attributes, a.k.a. links (paper by Kötters), or by substituting set-shaped concept intents with modal logic expressions (paper by Soldano and Ventos). A fourth group focused on data mining-oriented aspects of FCA: agreement lattices in structured data mining (paper by Nedjar et al.), triadic association rule mining (work by Missaoui and Kwuida) and bi-clustering of numerical data (Kaytoue et al.). An addional paper shed some initial light on a key aspect of FCA-based data analysis and mining, i.e., the filtering of interesting concepts (paper by Belohlavek and Macko). Finally, a set of exciting applications of both basic and enhanced FCA frameworks to practical problems have beed described: in analysis of gene expression data (the already mentioned work by GonzÃ¡lez Calabozo et al.), in web services composition (paper by Azmeh et al.) and in browsing and retrieval of structured data (work by Wray and Eklund). This volume also contains three keynote papers submitted by the invited speakers of the conference. All these contributions constitute a volume of high quality which is the result of the hard work done by the authors, the invited speakers and the reviewers. We therefore wish to thank the members of the Program Committee and of the Editorial Board whose steady involvement and professionalism helped a lot. We would also like to acknowledge the participation of all the external reviewers who sent many valuable comments. Kudos also go to EasyChair for having made the reviewing/editing process a real pleasure. Special thanks go to the Cyprus Tourism Organisation for sponsoring the conference and to the University of Nicosia for hosting it. Finally we wish to thank the Conference Chair Florent Domenach and his colleagues from the Organization Committee for the mountains of energy they put behind the conference organization process right from the beginning in order to make it a total success. We would also like to express our gratitude towards Dr. Peristianis, President of the University of Nicosia, for his personal support.

2010

Academic Publication Management with PUMA - collect, organize and share publications.
In: M. Lalmas, J. Jose, A. Rauber, F. Sebastiani and I. Frommholz, editors, Proceedings of the European Conference on Research and Advanced Technology for Digital Libraries (ECDL) 2010, volume 6273, series Lecture Notes in Computer Science, pages 417-420. Springer, Berlin/Heidelberg, 2010.
Dominik Benz, Andreas Hotho, Robert Jäschke, Gerd Stumme, Axel Halle, Angela Gerlach Sanches Lima, Helge Steenweg and Sven Stefani.
[doi] [abstract] [BibTeX]

Query Logs as Folksonomies.
Datenbank-Spektrum, 10(1):15-24, 2010.
Dominik Benz, Andreas Hotho, Robert Jäschke, Beate Krause and Gerd Stumme.
[doi] [abstract] [BibTeX]

The Social Bookmark and Publication Management System BibSonomy.
The VLDB Journal, 19(6):849-875, 2010.
Dominik Benz, Andreas Hotho, Robert Jäschke, Beate Krause, Folke Mitzlaff, Christoph Schmitz and Gerd Stumme.
[doi] [abstract] [BibTeX]

Publikationsmanagement mit BibSonomy - ein Social-Bookmarking-System für Wissenschaftler.
HMD - Praxis der Wirtschaftsinformatik, 271:47-58, 2010.
Andreas Hotho, Dominik Benz, Folke Eisterlehner, Robert Jäschke, Beate Krause, Christoph Schmitz and Gerd Stumme.
[doi] [abstract] [BibTeX]

2009

Managing publications and bookmarks with BibSonomy.
In: C. Cattuto, G. Ruffo and F. Menczer, editors, HT '09: Proceedings of the 20th ACM Conference on Hypertext and Hypermedia, pages 323-324. ACM, New York, NY, USA, 2009.
Dominik Benz, Folke Eisterlehner, Andreas Hotho, Robert Jäschke, Beate Krause and Gerd Stumme.
[doi] [abstract] [BibTeX]

ECML PKDD Discovery Challenge 2009 (DC09).
CEUR-WS.org. volume 497. 2009.
Folke Eisterlehner, Andreas Hotho and Robert Jäschke.
[doi] [BibTeX]

Social Bookmarking am Beispiel BibSonomy.
In: A. Blumauer and T. Pellegrini, editors, Social Semantic Web, chapter 18, pages 363-391. Springer, Berlin, Heidelberg, 2009.
Andreas Hotho, Robert Jäschke, Dominik Benz, Miranda Grahl, Beate Krause, Christoph Schmitz and Gerd Stumme.
[doi] [abstract] [BibTeX]

Testing and Evaluating Tag Recommenders in a Live System.
In: RecSys '09: Proceedings of the third ACM Conference on Recommender Systems, pages 369-372. ACM, New York, NY, USA, 2009.
Robert Jäschke, Folke Eisterlehner, Andreas Hotho and Gerd Stumme.
[doi] [abstract] [BibTeX]

Testing and Evaluating Tag Recommenders in a Live System.
In: D. Benz and F. Janssen, editors, Workshop on Knowledge Discovery, Data Mining, and Machine Learning, pages 44-51. 2009.
Robert Jäschke, Folke Eisterlehner, Andreas Hotho and Gerd Stumme.
[doi] [abstract] [BibTeX]

Mapping Bibliographic Records with Bibliographic Hash Keys.
In: R. Kuhlen, editor, Information: Droge, Ware oder Commons?, series Proceedings of the ISI. Verlag Werner Hülsbusch, 2009.
Jakob Voss, Andreas Hotho and Robert Jäschke.
[doi] [abstract] [BibTeX]

2008

Analyzing Tag Semantics Across Collaborative Tagging Systems.
In: H. Alani, S. Staab and G. Stumme, editors, Social Web Communities, series Dagstuhl Seminar Proceedings. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany, 2008.
Dominik Benz, Marko Grobelnik, Andreas Hotho, Robert Jäschke, Dunja Mladenic, Vito D. P. Servedio, Sergej Sizov and Martin Szomszor.
[doi] [abstract] [BibTeX]

Discovering Shared Conceptualizations in Folksonomies.
Journal of Web Semantics, 6(1):38-53, 2008.
Robert Jäschke, Andreas Hotho, Christoph Schmitz, Bernhard Ganter and Gerd Stumme.
[doi] [abstract] [BibTeX]

Logsonomy - A Search Engine Folksonomy.
In: Proceedings of the Second International Conference on Weblogs and Social Media (ICWSM 2008), pages 192-193. AAAI Press, Menlo Park, CA, USA, 2008.
Robert Jäschke, Beate Krause, Andreas Hotho and Gerd Stumme.
[doi] [abstract] [BibTeX]

Tag Recommendations in Social Bookmarking Systems.
AI Communications, 21(4):231-247, 2008.
Robert Jäschke, Leandro Marinho, Andreas Hotho, Lars Schmidt-Thieme and Gerd Stumme.
[doi] [abstract] [BibTeX]

Logsonomy - Social Information Retrieval with Logdata.
In: HT '08: Proceedings of the Nineteenth ACM Conference on Hypertext and Hypermedia, pages 157-166. ACM, New York, NY, USA, 2008.
Beate Krause, Robert Jäschke, Andreas Hotho and Gerd Stumme.
[doi] [abstract] [BibTeX]

2007

Analysis of the Publication Sharing Behaviour in BibSonomy.
In: U. Priss, S. Polovina and R. Hill, editors, Proceedings of the 15th International Conference on Conceptual Structures (ICCS 2007), volume 4604, series Lecture Notes in Artificial Intelligence, pages 283-295. Springer-Verlag, Berlin/Heidelberg, 2007.
Robert Jäschke, Andreas Hotho, Christoph Schmitz and Gerd Stumme.
[doi] [abstract] [BibTeX]

Organizing Publications and Bookmarks in BibSonomy.
In: H. Alani, N. Noy, G. Stumme, P. Mika, Y. Sure and D. Vrandecic, editors, Workshop on Social and Collaborative Construction of Structured Knowledge (CKC 2007) at WWW 2007. Banff, Canada, 2007.
Robert Jäschke, Miranda Grahl, Andreas Hotho, Beate Krause, Christoph Schmitz and Gerd Stumme.
[doi] [BibTeX]

Tag Recommendations in Folksonomies.
In: A. Hinneburg, editor, Workshop Proceedings of Lernen - Wissensentdeckung - Adaptivität (LWA 2007), pages 13-20. Martin-Luther-Universität Halle-Wittenberg, 2007.
Robert Jäschke, Leandro Marinho, Andreas Hotho, Lars Schmidt-Thieme and Gerd Stumme.
[doi] [abstract] [BibTeX]

Tag Recommendations in Folksonomies.
In: J. N. Kok, J. Koronacki, R. L. de Mántaras, S. Matwin, D. Mladenic and A. Skowron, editors, Knowledge Discovery in Databases: PKDD 2007, 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, volume 4702, series Lecture Notes in Computer Science, pages 506-514. Springer, Berlin, Heidelberg, 2007.
Robert Jäschke, Leandro Balby Marinho, Andreas Hotho, Lars Schmidt-Thieme and Gerd Stumme.
[doi] [abstract] [BibTeX]

2006

Semantic Network Analysis of Ontologies.
In: Y. Sure and J. Domingue, editors, The Semantic Web: Research and Applications, volume 4011, series Lecture Notes in Computer Science, pages 514-529. Springer, Berlin/Heidelberg, 2006. 10.1007/11762256_38
Bettina Hoser, Andreas Hotho, Robert Jäschke, Christoph Schmitz and Gerd Stumme.
[doi] [abstract] [BibTeX]

BibSonomy: A Social Bookmark and Publication Sharing System.
In: A. de Moor, S. Polovina and H. Delugach, editors, Proceedings of the Conceptual Structures Tool Interoperability Workshop at the 14th International Conference on Conceptual Structures. Aalborg University Press, Aalborg, Denmark, 2006.
Andreas Hotho, Robert Jäschke, Christoph Schmitz and Gerd Stumme.
[doi] [BibTeX]

Emergent Semantics in BibSonomy.
In: C. Hochberger and R. Liskowsky, editors, Informatik 2006 - Informatik für Menschen, volume 94, series Lecture Notes in Informatics, pages 305-312. Gesellschaft für Informatik, Bonn, 2006.
Andreas Hotho, Robert Jäschke, Christoph Schmitz and Gerd Stumme.
[doi] [abstract] [BibTeX]

Information Retrieval in Folksonomies: Search and Ranking.
In: Y. Sure and J. Domingue, editors, The Semantic Web: Research and Applications, volume 4011, series Lecture Notes in Computer Science, pages 411-426. Springer, Berlin/Heidelberg, 2006.
Andreas Hotho, Robert Jäschke, Christoph Schmitz and Gerd Stumme.
[doi] [abstract] [BibTeX]

Trend Detection in Folksonomies.
In: Y. S. Avrithis, Y. Kompatsiaris, S. Staab and N. E. O'Connor, editors, Proc. First International Conference on Semantics And Digital Media Technology (SAMT) , volume 4306, series Lecture Notes in Computer Science, pages 56-70. Springer, Heidelberg, 2006.
Andreas Hotho, Robert Jäschke, Christoph Schmitz and Gerd Stumme.
[doi] [abstract] [BibTeX]

TRIAS - An Algorithm for Mining Iceberg Tri-Lattices.
In: ICDM '06: Proceedings of the Sixth International Conference on Data Mining, pages 907-911. IEEE Computer Society, Washington, DC, USA, 2006.
Robert Jäschke, Andreas Hotho, Christoph Schmitz, Bernhard Ganter and Gerd Stumme.
[doi] [abstract] [BibTeX]

Wege zur Entdeckung von Communities in Folksonomies.
In: S. Braß and A. Hinneburg, editors, Proc. 18. Workshop Grundlagen von Datenbanken, pages 80-84. Martin-Luther-Universität , Halle-Wittenberg, 2006.
Robert Jäschke, Andreas Hotho, Christoph Schmitz and Gerd Stumme.
[doi] [abstract] [BibTeX]

Content Aggregation on Knowledge Bases using Graph Clustering.
In: Y. Sure and J. Domingue, editors, The Semantic Web: Research and Applications, volume 4011, series Lecture Notes in Computer Science, pages 530-544. Springer, Berlin/Heidelberg, 2006.
Christoph Schmitz, Andreas Hotho, Robert Jäschke and Gerd Stumme.
[doi] [abstract] [BibTeX]

Kollaboratives Wissensmanagement.
2006.
Christoph Schmitz, Andreas Hotho, Robert Jäschke and Gerd Stumme.
[doi] [abstract] [BibTeX]

Mining Association Rules in Folksonomies.
In: V. Batagelj, H.-H. Bock, A. Ferligoj and A. Žiberna, editors, Data Science and Classification, series Studies in Classification, Data Analysis, and Knowledge Organization, pages 261-270. Springer, Berlin/Heidelberg, 2006.
Christoph Schmitz, Andreas Hotho, Robert Jäschke and Gerd Stumme.
[doi] [abstract] [BibTeX]

2005

Die Struktur der Monoide binärer Relationen auf endlichen Mengen.
Master's thesis (Diplomarbeit), Technische Universität Dresden, 2005.
Robert Jäschke.
[doi] [BibTeX]

Humboldt-Universität zu Berlin - Berlin School of Library and Information Science