List of works by Justin Zobel

A Living Lab Study of Query Amendment in Job Search

A categorical analysis of coreference resolution errors in biomedical texts

scientific article published on 26 February 2016

Accurate and robust genomic prediction of celiac disease using statistical learning

scientific article

Automated Detection of Records in Biological Sequence Databases that are Inconsistent with the Literature

Automated assessment of biological database assertions using the scientific literature.

scientific article published on 29 April 2019

Automated detection of records in biological sequence databases that are inconsistent with the literature.

scientific article published on 14 June 2017

Bandage: interactive visualization of de novo genome assemblies

scientific article published on 22 June 2015

Benchmarks for Measurement of Duplicate Detection Methods in Nucleotide Databases

article

Benchmarks for measurement of duplicate detection methods in nucleotide databases

scientific article published on 8 January 2017

Boolean versus ranked querying for biomedical systematic reviews

scientific article

Cache-Conscious Collision Resolution in String Hash Tables

article by Nikolas Askitis & Justin Zobel published 2005 in Lecture Notes in Computer Science

Cache-conscious sorting of large sets of strings with dynamic tries

scientific article (publication date: 7 April 2005)

Capturing collection size for distributed non-cooperative retrieval

Compression of inverted indexes For fast query evaluation

article

Coreference resolution improves extraction of Biological Expression Language statements from texts

scientific article published on 03 July 2016

Design of an Efficient Out-of-Core Read Alignment Algorithm

Document Compaction for Efficient Query Biased Snippet Generation

article

Document Computing

Document Lifecycle

Duplicates, redundancies and inconsistencies in the primary nucleotide databases: a descriptive study

scientific article

Duplicates, redundancies, and inconsistencies in the primary nucleotide databases: a descriptive study

Evaluation of a Machine Learning Duplicate Detection Method for Bioinformatics Databases

scholarly article published 2015

Experiments in spoken document retrieval using phoneme n-grams

Exploring effective approaches for haplotype block phasing

scientific article published on 30 October 2019

Finding approximate matches in large lexicons

GeneMates: an R package for detecting horizontal gene co-transfer between bacteria using gene-gene associations controlled for population structure

scientific article published on 24 September 2020

Generation of Synthetic Query Auto Completion Logs

Gossamer--a resource-efficient de novo assembler

scientific article published on 18 May 2012

High performance computing enabling exhaustive analysis of higher order single nucleotide polymorphism interaction in Genome Wide Association Studies

scientific article

Inverted files for text search engines

scientific article (publication date: 25 July 2006)

Inverted files versus signature files for text indexing

article published in 1998

Iterative dictionary construction for compression of large DNA data sets

scientific article

Learning Biological Sequence Types Using the Literature

Literature Consistency of Bioinformatics Sequence Databases is Effective for Assessing Record Quality

Literature consistency of bioinformatics sequence databases is effective for assessing record quality.

scientific article

MedIR14

Methods for identifying versioned and plagiarized documents

scientific article (publication date: 2003)

Performance and robustness of penalized and unpenalized methods for genetic prediction of complex human disease.

scientific article published on 30 November 2012

Prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context

scientific article

Quality Matters: Biocuration Experts on the Impact of Duplication and Other Data Quality Issues in Biological Databases

scientific article published on 08 July 2020

Quantifying the impact of concept recognition on biomedical information retrieval

Query expansion using associated queries

Querying in a Large Hyperbase

Reference-Free Validation of Short Read Data

scientific article published on September 22, 2010

SRST2: Rapid genomic surveillance for public health and hospital microbiology labs

scientific article

Sample Sizes for Query Probing in Uncooperative Distributed Information Retrieval

article

Search Effectiveness in Nonredundant Sequence Databases: Assessments and Solutions

scientific article published on 24 December 2018

Short read sequence typing (SRST): multi-locus sequence types from short reads

scientific article

SparSNP: fast and memory-efficient analysis of all SNPs for phenotype prediction.

scientific article

Supercomputing enabling exhaustive statistical analysis of genome wide association study data: Preliminary results

scientific article

Supervised Learning for Detection of Duplicates in Genomic Sequence Databases

scientific article

The Impact of Judgment Variability on the Consistency of Offline Effectiveness Measures

The MG retrieval system: compressing for space and speed

The challenge of high recall in biomedical systematic search

Using query logs to establish vocabularies in distributed information retrieval

Visualizing search results and document collections using topic maps

scholarly article by David Newman et al published July 2010 in Journal of Web Semantics