Search filters

List of works by Yoshua Bengio

A Hierarchical Recurrent Encoder-Decoder for Generative Context-Aware Query Suggestion

A Neural Knowledge Language Model

scientific article published in August 2016

A Neural Network to Detect Homologies in Proteins

scientific article published in January 1990

A Neural Probabilistic Language Model

scholarly article from 2003 from Journal of Machine Learning Research

A Parallel Mixture of SVMs for Very Large Scale Problems

A Recurrent Latent Variable Model for Sequential Data

A Two-Stream Continual Learning System With Variational Domain-Agnostic Feature Replay

scientific article published in 2022

A deep learning framework for neuroscience

scientific article published on 28 October 2019

A hybrid pareto mixture for conditional asymmetric fat-tailed distributions

scientific article published on 26 May 2009

A neural probabilistic language model

scientific article from NIPS 2000

A parallel mixture of SVMs for very large scale problems

scientific article published in May 2002

A semantic matching energy function for learning with multi-relational data

scientific article published on 30 May 2013

Adaptive importance sampling to accelerate training of a neural probabilistic language model

scientific article published in April 2008

Advances in optimizing recurrent networks

Algorithms for Hyper-Parameter Optimization

scientific article published in January 2011

Alternative time representation in dopamine models

scientific article published on 22 October 2009

An Infinite Factor Model Hierarchy Via a Noisy-Or Mechanism

scientific article published in January 2009

An Input Output HMM Architecture

scientific article published in January 1995

An empirical evaluation of deep architectures on problems with many factors of variation

scholarly article published 2007

Architectural Complexity Measures of Recurrent Neural Networks

scientific article published in January 2016

Attention-Based Models for Speech Recognition

scholarly article by Jan K. Chorowski et al published 2015 in Advances in Neural Information Processing Systems 28

Augmented Functional Time Series Representation and Forecasting with Gaussian Processes

Bayesian Model-Agnostic Meta-Learning

Bias learning, knowledge sharing

scientific article published in January 2003

BigBrain 3D atlas of cortical layers: Cortical and laminar thickness gradients diverge in sensory and motor cortices

scientific article published on 03 April 2020

Binarized Neural Networks

scientific article published in January 2016

BinaryConnect: Training Deep Neural Networks with binary weights during propagations

Blocks and Fuel: Frameworks for deep learning

Boosting neural networks.

scientific article

Brain Inspired Reinforcement Learning

scientific article published in January 2005

Brain tumor segmentation with Deep Neural Networks

scientific article published on 19 May 2016

CAMAP: Artificial neural networks unveil the role of codon arrangement in modulating MHC-I peptides presentation

scientific article published on 22 October 2021

Catalyzing next-generation Artificial Intelligence through NeuroAI

scientific article published on 22 March 2023

Challenges in Representation Learning: A Report on Three Machine Learning Contests

scholarly article by Ian J. Goodfellow et al published 2013 in Lecture Notes in Computer Science

Challenges in representation learning: a report on three machine learning contests

scientific article published on 29 December 2014

Char2Wav: End-to-End Speech Synthesis

scientific article (publication date: 2017)

Classification using discriminative restricted Boltzmann machines

Collaborative filtering on a family of biological targets.

scientific article

Computing Power and the Governance of Artificial Intelligence

Conditioning and time representation in long short-term memory networks

scientific article published on 21 November 2013

Contextual tag inference

Convergence Properties of the K-Means Algorithms

scientific article published in January 1995

Convex Neural Networks

scientific article published in January 2006

Cost functions and model combination for VaR-based asset allocation using neural networks

scientific article published on 01 January 2001

Credit Assignment through Time: Alternatives to Backpropagation

scientific article published in January 1994

Curriculum learning

scientific article published on 2009

Deep Generative Stochastic Networks Trainable by Backprop

scholarly article

Deep Learning

book edition from 2016 by Goodfellow, Bengio and Courville

Deep convolutional networks for quality assessment of protein folds

scientific article published on 01 December 2018

Deep learning

Nature article from 2015 by LeCun, Bengio and Hinton

Deep learning for AI

journal article from 'Communications of the ACM' published in 2021

Dendritic cortical microcircuits approximate the backpropagation algorithm

Depth with nonlinearity creates no bad local minima in ResNets

scientific article published on 01 July 2019

Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks

article by Kyunghyun Cho et al published November 2015 in IEEE Transactions on Multimedia

Dialogues interdisciplinaires : les risques majeurs de l'IA générative

Diffusion of Credit in Markovian Models

scientific article published in January 1995

Drawing and Recognizing Chinese Characters with Recurrent Neural Network

scientific article published on 18 April 2017

Dynamic Neural Turing Machine with Continuous and Discrete Addressing Schemes

scientific article published on 30 January 2018

Editorial introduction to the Neural Networks special issue on Deep Learning of Representations

scientific article published on 15 December 2014

Efficient Non-Parametric Function Induction in Semi-Supervised Learning

scientific article (publication date: 2005)

Efficient recognition of immunoglobulin domains from amino acid sequences using a neural network

scientific article published in October 1990

Equilibrated adaptive learning rates for non-convex optimization

academic article

Equilibrium Propagation: Bridging the Gap between Energy-Based Models and Backpropagation

scientific article

Equivalence of Equilibrium Propagation and Recurrent Backpropagation

scientific article published on 21 December 2018

Estimating Car Insurance Premia: a Case Study in High-Dimensional Data Inference

article by Nicolas Chapados et al published 2002 in Advances in Neural Information Processing Systems 14

Experiments on the application of IOHMMs to model financial returns series

scientific article published on 01 January 2001

Extracting and composing robust features with denoising autoencoders

Factorized embeddings learns rich and biologically meaningful embedding spaces using factorized tensor decomposition

scientific article published on 01 July 2020

GFlowNets for AI-driven scientific discovery

scientific article published in January 2023

Gated Orthogonal Recurrent Units: On Learning to Forget

scientific article published on 14 February 2019

Generalization in Deep Learning

scientific article

Generalized Denoising Auto-Encoders as Generative Models

Generating Multiscale Amorphous Molecular Structures Using Deep Learning: A Study in 2D

scientific article published on 24 September 2020

Generative Adversarial Nets

paper introducing GANs

GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

scientific article published in January 2017

Global optimization of a neural network-hidden Markov model hybrid

scientific article published in January 1992

Globally Trained Handwritten Word Recognizer using Spatial Representation, Convolutional Neural Networks, and Hidden Markov Models

scientific article published in January 1994

Gradient based sample selection for online continual learning

Gradient-based learning applied to document recognition

scholarly article

Gradient-based optimization of hyperparameters.

scientific article

Graph Attention Networks

scholarly article

Greedy Layer-Wise Training of Deep Networks

scientific article published in January 2007

Hierarchical Recurrent Neural Networks for Long-Term Dependencies

scientific article published in January 1996

HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering

How does hemispheric specialization contribute to human-defining cognition?

journal article from 'Neuron' published in 2021

How to Initialize your Network? Robust Initialization for WeightNorm & ResNets

How transferable are features in deep neural networks?

scientific article published in January 2014

Hybrid Models for Learning to Branch

Identifying and attacking the saddle point problem in high-dimensional non-convex optimization

scientific article published in January 2014

Image-to-image translation for cross-domain disentanglement

Incorporating Second-Order Functional Knowledge for Better Option Pricing

Inference for the Generalization Error

Inherent privacy limitations of decentralized contact tracing apps

scientific article published on 25 June 2020

Input-output HMMs for sequence processing

scientific article published in January 1996

Interdisciplinary Dialogues: The Major Risks of Generative AI

Iterative Neural Autoregressive Distribution Estimator NADE-k

scientific article published in January 2014

Justifying and generalizing contrastive divergence

scientific article published in June 2009

K-Local Hyperplane and Convex Distance Nearest Neighbor Algorithms

scholarly article by Pascal Vincent & Yoshua Bengio published 2002 in Advances in Neural Information Processing Systems 14

Kernel Matching Pursuit

LeRec: A NN/HMM Hybrid for On-Line Handwriting Recognition

scientific article published on November 1, 1995

Learning Deep Architectures for AI

scientific article (publication date: 2009)

Learning Fixed Points in Generative Adversarial Networks: From Image-to-Image Translation to Disease Detection and Localization

scientific article published on 01 November 2019

Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning

scientific article published on 30 March 2018

Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation

scholarly article

Learning deep physiological models of affect

scholarly article by Hector P. Martinez et al published May 2013 in IEEE Computational Intelligence Magazine

Learning eigenfunctions links spectral embedding and kernel PCA.

scientific article published in October 2004

Learning long-term dependencies with gradient descent is difficult

scientific article published in January 1994

Learning normalized inputs for iterative estimation in medical image segmentation

scientific article published on 14 November 2017

Learning structured embeddings of knowledge bases

article

Learning the 2-D Topology of Images

scientific article published in January 2008

Learning to Understand Phrases by Embedding the Dictionary

scientific article published in 2016

Locally linear embedding for dimensionality reduction in QSAR.

scientific article published in July 2004

Machine learning for combinatorial optimization: A methodological tour d’horizon

journal article from 'European Journal of Operational Research' published in 2021

Machines Who Learn

scientific article

Managing extreme AI risks amid rapid progress

scientific article published on 24 May 2024

Manifold Parzen Windows

scientific article published in January 2003

MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis

scientific article published in January 2019

MetaGAN: An Adversarial Approach to Few-Shot Learning

Modeling High-Dimensional Discrete Data with Multi-Layer Neural Networks

scholarly article by Yoshua Bengio & Samy Bengio published 2000 in Advances in Neural Information Processing Systems 12

Multi-Prediction Deep Boltzmann Machines

Multi-Task Learning for Stock Selection

NICE: Non-linear Independent Components Estimation

journal article from '3rd International Conference on Learning Representations, ICLR 2015 - Workshop Track Proceedings' published in 2014

Neural Machine Translation by Jointly Learning to Align and Translate

scholarly article

Neural Network - Gaussian Mixture Hybrid for Speech Recognition or Density Estimation

scientific article published in January 1992

Neural Probabilistic Language Models

No Unbiased Estimator of the Variance of K-Fold Cross-Validation

scientific article published in January 2004

Non-Local Manifold Parzen Windows

scientific article published in January 2006

Non-Local Manifold Tangent Learning

scientific article published in January 2005

Non-normal Recurrent Neural Network (nnRNN): learning long time dependencies while improving expressivity with transient dynamics

scholarly article by Giancarlo Kerg et al published 2019 in Advances in Neural Information Processing Systems 32

Nonlocal estimation of manifold structure

scientific article published on October 2006

On Adversarial Mixup Resynthesis

On Multiplicative Integration with Recurrent Neural Networks

scientific article published in January 2016

On Tracking The Partition Function

scientific article published in January 2011

On the Number of Linear Regions of Deep Neural Networks

scientific article published in January 2014

On the Properties of Neural Machine Translation: Encoder-Decoder Approaches

On the challenge of learning complex functions

scientific article published on January 2007

Out-of-Sample Extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering

scientific article published in January 2004

Plan, Attend, Generate: Planning for Sequence-to-Sequence Models

scientific article published in January 2017

Predicting COVID-19 Pneumonia Severity on Chest X-ray With Deep Learning

scientific article published on 28 July 2020

Professor Forcing: A New Algorithm for Training Recurrent Networks

scientific article published in January 2016

Quick Training of Probabilistic Neural Nets by Importance Sampling

scientific article published in 2003

Recurrent Neural Networks for Missing or Asynchronous Data

scientific article published in January 1996

Regulating advanced artificial agents

scientific article published on 04 April 2024

Representation Learning: A Review and New Perspectives

review article by Bengio, Courville and Vincent on arXiv

Representation learning: a review and new perspectives

scientific article published on August 2013 in IEEE PAMI

Representational power of restricted boltzmann machines and deep belief networks

scientific article published in June 2008

Robust regression with asymmetric heavy-tail noise distributions

scientific article published in October 2002

STDP-Compatible Approximation of Backpropagation in an Energy-Based Model

scientific article published on 17 January 2017

Scaling Equilibrium Propagation to Deep ConvNets by Drastically Reducing Its Gradient Estimator Bias

scientific article published on 18 February 2021

Scaling up spike-and-slab models for unsupervised feature learning

scientific article published in August 2013

Scientific discovery in the age of artificial intelligence

scientific article published on 2 August 2023

Selective small molecule peptidomimetic ligands of TrkC and TrkA receptors afford discrete or complete neurotrophic activities

scientific article

Semi-supervised Learning by Entropy Minimization

scientific article published in January 2005

Shallow vs. Deep Sum-Product Networks

scientific article published in January 2011

Shared Context Probabilistic Transducers

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

scholarly article

Slow, Decorrelated Features for Pretraining Complex Cell-like Networks

scientific article published in January 2009

Sparse Attentive Backtracking: Temporal Credit Assignment Through Reminding

Speaker Independent Speech Recognition with Neural Networks and Speech Knowledge

scientific article published in January 1990

Stochastic Ratio Matching of RBMs for Sparse High-Dimensional Inputs

Suitability of V1 Energy Models for Object Classification

scientific article published on December 16, 2010

Tackling Climate Change with Machine Learning

scientific article published on 5 November 2019

Tackling Climate Change with Machine Learning

scientific article published on 08 February 2022

Taking on the curse of dimensionality in joint distributions using neural networks

scientific article published on 01 January 2000

The Consciousness Prior

academic journal article

The Curse of Highly Variable Functions for Local Kernel Machines

scientific article published in January 2006

The Manifold Tangent Classifier

scientific article published in January 2011

The Spike-and-Slab RBM and Extensions to Discrete and Sparse Data Distributions

scientific article

The need for privacy with public digital contact tracing during the COVID-19 pandemic

scientific article published on 02 June 2020

Theano: A Python framework for fast computation of mathematical expressions

journal article

Theano: a CPU and GPU math compiler in Python

scientific article published on 2010

Topmoumoute Online Natural Gradient Algorithm

scientific article published in January 2008

Toward Causal Representation Learning

journal article from 'Proceedings of the IEEE' published in 2021

Toward Training Recurrent Neural Networks for Lifelong Learning

scientific article published on 08 November 2019

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims

scientific article published on 15 April 2020

Towards Biologically Plausible Deep Learning

scientific article published on 9 August 2016

Towards equilibrium molecular conformation generation with GFlowNets

scientific article published in January 2024

Tractable multivariate binary density estimation and the restricted Boltzmann forest

scientific article published in September 2010

Training Methods for Adaptive Boosting of Neural Networks

scholarly article by Holger Schwenk & Yoshua Bengio published 1998 in Advances in Neural Information Processing Systems 10

Unsupervised State Representation Learning in Atari

Untangling tradeoffs between recurrence and self-attention in artificial neural networks

scientific article published in November 2020

Updates of Equilibrium Prop Match Gradients of Backprop Through Time in an RNN with Static Input

Use machine learning to find energy materials.

scientific article published in December 2017

Use of Multi-Layered Networks for Coding Speech with Phonetic Features

scientific article published in January 1989

Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding

scholarly article by Gregoire Mesnil et al published March 2015 in IEEE/ACM transactions on audio, speech, and language processing

Using a Financial Training Criterion Rather than a Prediction Criterion

scientific article published on August 1, 1997

Variational Temporal Abstraction

Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net

scientific article published in January 2017

Visualizing the Consequences of Climate Change Using Cycle-Consistent Adversarial Networks

scientific article published on 2 May 2019

Wasserstein Dependency Measure for Representation Learning

scientific article published in January 2019

Word Representations: A Simple and General Method for Semi-Supervised Learning

scientific article published in July 2010

Word-level training of a handwritten word recognizer based on convolutional neural networks

Your GAN is Secretly an Energy-based Model and You Should Use Discriminator Driven Latent Sampling

scientific article published in November 2020

Z-Forcing: Training Stochastic Recurrent Networks

scientific article published in January 2017