Advanced search

Authors whose works are in public domain in at least one jurisdiction

List of works by Christian Bizer

Evolving the Web into a Global Data Space

book

Hyperlink Graph of the World Wide Web of 2012 (aggregated by first level subdomains)

Hyperlink Graph of the World Wide Web (aggregated by fist level subdomains) extracted from the Common Crawl Dataset 2012

Hyperlink Graph of the World Wide Web of 2012 (aggregated by host)

Hyperlink Graph of the World Wide Web (aggregated by host) extracted from the Common Crawl Dataset 2012

Hyperlink Graph of the World Wide Web of 2012 (aggregated by pay-level-domain)

Hyperlink Graph of the World Wide Web (aggregated by pay-level-domain) extracted from the Common Crawl Dataset 2012

Impact of the Characteristics of Multi-source Entity Matching Tasks on the Performance of Active Learning Methods

Linked Data - The Story So Far

2023 reprint of a 2009 article

Using ChatGPT for Entity Matching

Web Data Commons - Gold Standard for Feature Extraction

Gold standard containing manually labeled product entity features for the product categories: phones, televisions, and headphones.

Web Data Commons - Gold Standard for Product Matching

Gold standard containing manually labeled product entity correspondences for the product categories: phones, televisions, and headphones.

Web Data Commons - Product Data Corpus

Product Data Corpus for product matching and product feature extraction

Web Data Commons - RDFa, Microdata, and Microformat 2009/2010 Data Set

Microformat, Microdata and RDFa data from 2009 Common Crawl web corpus

Web Data Commons - RDFa, Microdata, and Microformat August 2012 Data Set

Microformat, Microdata and RDFa data from the August 2012 Common Crawl web corpus

Web Data Commons - RDFa, Microdata, and Microformat December 2014 Data Set

Microformat, Microdata and RDFa data from the December 2014 Common Crawl web corpus

Web Data Commons - RDFa, Microdata, and Microformat November 2013 Data Set

Microformat, Microdata and RDFa data from the November 2013 Common Crawl web corpus

Web Data Commons - RDFa, Microdata, and Microformat November 2015 Data Set

Microformat, Microdata and RDFa data from the November 2015 Common Crawl web corpus

Web Data Commons - RDFa, Microdata, and Microformat October 2016 Data Set

Microformat, Microdata and RDFa data from the October 2016 Common Crawl web corpus

Web Data Commons - Web Table Corpus 2012 / English-language Relational Subset

Web relational english table corpus extracted from the August 2012 Common Crawl

Web Data Commons - Web Table Corpus 2012 / Relational Data

Web relational table corpus extracted from the August 2012 Common Crawl

Web Data Commons - Web Table Corpus 2015

Web table corpus extracted from the July 2015 Common Crawl

Web Data Commons - Web Table Corpus 2015 / English-language Relational Subset

Web relational english table corpus extracted from the July 2015 Common Crawl

Web Data Commons - Web Table Corpus 2015 / Relational Data

Web relational table corpus extracted from the July 2015 Common Crawl

Web Data Commons - WebIsA Database

Hypernymy relations extracted from the CommonCrawl web corpus