book
Hyperlink Graph of the World Wide Web (aggregated by fist level subdomains) extracted from the Common Crawl Dataset 2012
Hyperlink Graph of the World Wide Web (aggregated by host) extracted from the Common Crawl Dataset 2012
Hyperlink Graph of the World Wide Web (aggregated by pay-level-domain) extracted from the Common Crawl Dataset 2012
2023 reprint of a 2009 article
Gold standard containing manually labeled product entity features for the product categories: phones, televisions, and headphones.
Gold standard containing manually labeled product entity correspondences for the product categories: phones, televisions, and headphones.
Product Data Corpus for product matching and product feature extraction
Microformat, Microdata and RDFa data from 2009 Common Crawl web corpus
Microformat, Microdata and RDFa data from the August 2012 Common Crawl web corpus
Microformat, Microdata and RDFa data from the December 2014 Common Crawl web corpus
Microformat, Microdata and RDFa data from the November 2013 Common Crawl web corpus
Microformat, Microdata and RDFa data from the November 2015 Common Crawl web corpus
Microformat, Microdata and RDFa data from the October 2016 Common Crawl web corpus
Web relational english table corpus extracted from the August 2012 Common Crawl
Web relational table corpus extracted from the August 2012 Common Crawl
Web table corpus extracted from the July 2015 Common Crawl
Web relational english table corpus extracted from the July 2015 Common Crawl
Web relational table corpus extracted from the July 2015 Common Crawl
Hypernymy relations extracted from the CommonCrawl web corpus
Paulina is supported by:
About Paulina