Search filters

List of works by Geoff Webb

$$\text {ALR}^n$$ ALR n : accelerated higher-order logistic regression

A Comparative Study of Bandwidth Choice in Kernel Density Estimation for Naive Bayesian Classification

A Data Scientist's Guide to Start-Ups

scientific article

A Fast Trust-Region Newton Method for Softmax Logistic Regression

scholarly article published 9 June 2017

A Multiple Test Correction for Streams and Cascades of Statistical Hypothesis Tests

scientific article

A New Restricted Bayesian Network Classifier

A Statistically Efficient and Scalable Method for Log-Linear Analysis of High-Dimensional Data

Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features.

scientific article

Accurate parameter estimation for Bayesian network classifiers using hierarchical Dirichlet processes

An accurate and fully-automated ensemble model for weekly time series forecasting

scientific article published in 2023

Anytime classification for a pool of instances

article by Bei Hui et al published 2 July 2009 in Machine Learning

Association discovery

Bioinformatic approaches for predicting substrates of proteases

scientific article published on February 2011

Cascleave: towards more accurate prediction of caspase substrate cleavage sites

scientific article published on 3 February 2010

Cell graph neural networks enable the precise prediction of patient survival in gastric cancer

scientific article published in 2022

Characterizing concept drift

Comprehensive assessment and performance improvement of effector protein predictors for bacterial secretion systems III, IV and VI.

scientific article

Contrary to Popular Belief Incremental Discretization can be Sound, Computationally Efficient and Extremely Useful for Streaming Data

Cost-sensitive specialization

Critical evaluation of bioinformatics tools for the prediction of protein crystallization propensity

scientific article published on 27 February 2017

Critical evaluation of bioinformatics tools for the prediction of protein crystallization propensity.

scientific article published on 22 June 2017

Crysalis: an integrated server for computational analysis and design of protein crystallization.

scientific article

Designing a more efficient, effective and safe Medical Emergency Team (MET) service using data analysis.

scientific article published on 27 December 2017

Directed Graphs

Discovering Significant Patterns

Discovering significant patterns

Discovery of amino acid motifs for thrombin cleavage and validation using a model substrate

scientific article

Discretization Methods

Discretization for naive-Bayes learning: managing discretization bias and variance

article

Dynamic Time Warping Averaging of Time Series Allows Faster and More Accurate Classification

EGM: encapsulated gene-by-gene matching to identify gene orthologs and homologous segments in genomes

scientific article published on 27 June 2010

Editorial

Efficient Discovery of the Most Interesting Associations

Efficient and Effective Accelerated Hierarchical Higher-Order Logistic Regression for Large Data Quantities

Efficient large-scale protein sequence comparison and gene matching to identify orthologs and co-orthologs

scientific article published on December 30, 2011

Efficient parameter learning of Bayesian network classifiers

Efficient search for association rules

Ensemble Selection for SuperParent-One-Dependence Estimators

Extremely Fast Decision Tree

Fast and Effective Single Pass Bayesian Learning

Faster and more accurate classification of time series by exploiting a novel dynamic time warping averaging algorithm

Feature Based Modelling: A methodology for producing coherent, consistent, dynamically changing models of agents' competencies

Feature-subspace aggregating: ensembles for stable and unstable learners

Filtered-top-k association discovery

GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features

scientific article published on 06 October 2016

GlycoMine: a machine learning-based approach for predicting N-, C- and O-linked glycosylation in the human proteome

scientific article published on 6 January 2015

GraphormerDTI: A graph transformer-based approach for drug-target interaction prediction

scientific article published in 2024

Highly Scalable Attribute Selection for Averaged One-Dependence Estimators

Incremental Discretization for Naïve-Bayes Classifier

Indexing and classifying gigabytes of time series under time warping

Inducing diagnostic rules for glomerular disease with the DLG machine learning algorithm

scholarly article by Geoff Webb & John W.M Agar published December 1992 in Artificial Intelligence in Medicine

Integrating machine learning with knowledge acquisition through direct interaction with domain experts

scholarly article by Geoff Webb published June 1996 in Knowledge-Based Systems

Introduction: special issue of selected papers of ACML 2013

K-Optimal Rule Discovery

Knowledge-transfer learning for prediction of matrix metalloprotease substrate-cleavage sites

scientific article

Large-scale comparative assessment of computational predictors for lysine post-translational modification sites

Layered critical values: a powerful direct-adjustment approach to discovering significant patterns

Learning by extrapolation from marginal to full-multivariate probability distributions: decreasingly naive Bayesian classification

Learning crew scheduling constraints from historical schedules

Machine Learning for User Modeling

scholarly article

MetalExplorer, a Bioinformatics Tool for the Improved Prediction of Eight Types of Metal-Binding Sites Using a Random Forest Algorithm with Two- Step Feature Selection

Mining significant association rules from uncertain data

Mining significant crisp-fuzzy spatial association rules

Naive-Bayes Inspired Effective Pre-Conditioner for Speeding-Up Logistic Regression

Non-Disjoint Discretization for Aggregating One-Dependence Estimator Classifiers

Not So Naive Bayes: Aggregating One-Dependence Estimators

article published in 2005

On detecting differences between groups

On the Application of ROC Analysis to Predict Classification Performance Under Varying Class Distributions

POSSUM: a bioinformatics toolkit for generating numerical sequence feature descriptors based on PSSM profiles.

scientific article published in September 2017

PREvaIL, an integrative approach for inferring catalytic residues using sequence, structural, and network features in a machine-learning framework

scientific article published on 30 January 2018

PRISMOID: a comprehensive 3D structure database for post-translational modifications and mutations with functional impact

PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites

scientific article

PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy.

scientific article

Periscope: quantitative prediction of soluble protein expression in the periplasm of Escherichia coli

scientific article published on 2 March 2016

PhosphoPredict: A bioinformatics tool for prediction of human kinase-specific phosphorylation substrates and sites by integrating heterogeneous feature selection

scientific article

Positive-unlabelled learning of glycosylation sites in the human proteome

Preconditioning an Artificial Neural Network Using Naive Bayes

scholarly article by Nayyar A. Zaidi et al published 2016 in Lecture Notes in Computer Science

Prodepth: predict residue depth by support vector regression approach from protein sequences only

scientific article

Proximity Forest: an effective and scalable distance-based classifier for time series

RCPdb: An evolutionary classification and codon usage database for repeat-containing proteins

scientific article published on 13 June 2007

Robust Bayesian Kernel Machine via Stein Variational Gradient Descent for Big Data

Sample-Based Attribute Selective A$n$ DE for Large Data

Scalable Learning of Graphical Models

Scaling Log-Linear Analysis to High-Dimensional Data

Scaling log-linear analysis to datasets with thousands of variables

SecretEPDB: a comprehensive web-based resource for secreted effector proteins of the bacterial types III, IV and VI secretion systems

scientific article published on 23 January 2017

Selective AnDE for large data learning: a low-bias memory constrained approach

article

Self-sufficient itemsets

SimUSF: an efficient and effective similarity measure that is invariant to violations of the interval scale assumption

Skopus: Mining top-k sequential patterns under leverage

Smoothing a rugged protein folding landscape by sequence-based redesign

scientific article published on 26 September 2016

Specious rules: an efficient and effective unifying method for removing misleading and uninformative patterns in association rule mining

Structural Capacitance in Protein Evolution and Human Diseases

Structural Capacitance in Protein Evolution and Human Diseases

scientific article published on 03 July 2018

Structural and dynamic properties that govern the stability of an engineered fibronectin type III domain

scientific article

Subsumption resolution: an efficient and effective technique for semi-naive Bayesian learning

Survey of distance measures for quantifying concept drift and shift in numeric data

Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches.

scientific article published on 27 November 2017

TANGLE: two-level support vector regression approach for protein backbone torsion angle prediction from primary sequences

scientific article

Techniques for Efficient Learning without Search

Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods

Ultra-fast meta-parameter optimization for time series similarity measures with application to nearest neighbour classification

scientific article published in 2023

iFeature: a python package and web server for features extraction and selection from protein and peptide sequences.

scientific article published on 8 March 2018

iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites