Sepp Hochreiter

research

∙ 07/10/2023

SITTA: A Semantic Image-Text Alignment for Image Captioning

Textual and semantic comprehension of images is essential for generating...

0 Fabian Paischer, et al. ∙

research

∙ 07/06/2023

Quantification of Uncertainty with Adversarial Models

Quantifying uncertainty is important for actionable predictions in real-...

0 Kajetan Schweighofer, et al. ∙

research

∙ 06/26/2023

Learning to Modulate pre-trained Models in RL

Reinforcement Learning (RL) has been successful in various domains like ...

0 Thomas Schmied, et al. ∙

research

∙ 06/15/2023

Semantic HELM: An Interpretable Memory for Reinforcement Learning

Reinforcement learning agents deployed in the real world often have to c...

0 Fabian Paischer, et al. ∙

research

∙ 05/02/2023

Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation

We study the problem of choosing algorithm hyper-parameters in unsupervi...

7 Marius-Constantin Dinu, et al. ∙

research

∙ 04/20/2023

Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget

Masked Image Modeling (MIM) methods, like Masked Autoencoders (MAE), eff...

0 Johannes Lehner, et al. ∙

research

∙ 03/22/2023

Conformal Prediction for Time Series with Modern Hopfield Networks

To quantify uncertainty, conformal prediction methods are gaining contin...

0 Andreas Auer, et al. ∙

research

∙ 03/14/2023

Traffic4cast at NeurIPS 2022 – Predict Dynamics along Graph Edges from Sparse Node Data: Whole City Traffic and ETA from Stationary Vehicle Detectors

The global trends of urbanization and increased personal mobility force ...

0 Moritz Neun, et al. ∙

research

∙ 03/06/2023

Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language

Activity and property prediction models are the central workhorses in dr...

0 Philipp Seidl, et al. ∙

research

∙ 02/17/2023

G-Signatures: Global Graph Propagation With Randomized Signatures

Graph neural networks (GNNs) have evolved into one of the most popular d...

0 Bernhard Schäfl, et al. ∙

research

∙ 08/08/2022

Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks

The synthesis of high-resolution remote sensing images based on text des...

3 Yonghao Xu, et al. ∙

research

∙ 07/12/2022

Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning

In lifelong learning, an agent learns throughout its entire life without...

10 Christian Alexander Steinparz, et al. ∙

research

∙ 06/07/2022

Few-Shot Learning by Dimensionality Reduction in Gradient Space

We introduce SubGD, a novel few-shot learning method which is based on t...

5 Martin Gauch, et al. ∙

research

∙ 06/02/2022

Entangled Residual Mappings

Residual mappings have been shown to perform representation learning in ...

6 Mathias Lechner, et al. ∙

research

∙ 06/01/2022

Hopular: Modern Hopfield Networks for Tabular Data

While Deep Learning excels in structured data as encountered in vision a...

29 Bernhard Schäfl, et al. ∙

research

∙ 05/24/2022

History Compression via Language Models in Reinforcement Learning

In a partially observable Markov decision process (POMDP), an agent typi...

5 Fabian Paischer, et al. ∙

research

∙ 03/31/2022

Traffic4cast at NeurIPS 2021 – Temporal and Spatial Few-Shot Transfer Learning in Gridded Geo-Spatial Processes

The IARAI Traffic4cast competitions at NeurIPS 2019 and 2020 showed that...

4 Christian Eichenberger, et al. ∙

research

∙ 11/08/2021

Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning

In real world, affecting the environment by a weak policy can be expensi...

24 Kajetan Schweighofer, et al. ∙

research

∙ 10/21/2021

CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP

Contrastive learning with the InfoNCE objective is exceptionally success...

11 Andreas Fürst, et al. ∙

research

∙ 06/21/2021

Boundary Graph Neural Networks for 3D Simulations

The abundance of data has given machine learning huge momentum in natura...

11 Andreas Mayr, et al. ∙

research

∙ 05/04/2021

Learning 3D Granular Flow Simulations

Recently, the application of machine learning models has gained momentum...

27 Andreas Mayr, et al. ∙

research

∙ 04/07/2021

Modern Hopfield Networks for Few- and Zero-Shot Reaction Prediction

An essential step in the discovery of new drugs and materials is the syn...

7 Philipp Seidl, et al. ∙

research

∙ 03/31/2021

Trusted Artificial Intelligence: Towards Certification of Machine Learning Applications

Artificial Intelligence is one of the fastest growing technologies of th...

44 Philip Matthias Winter, et al. ∙

research

∙ 01/13/2021

MC-LSTM: Mass-Conserving LSTM

The success of Convolutional Neural Networks (CNNs) in computer vision i...

13 Pieter-Jan Hoedt, et al. ∙

research

∙ 12/02/2020

Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER

We prove under commonly used assumptions the convergence of actor-critic...

4 Markus Holzleitner, et al. ∙

research

∙ 10/15/2020

Rainfall-Runoff Prediction at Multiple Timescales with a Single Long Short-Term Memory Network

Long Short-Term Memory Networks (LSTMs) have been applied to daily disch...

0 Martin Gauch, et al. ∙

research

∙ 10/13/2020

Cross-Domain Few-Shot Learning by Representation Fusion

In order to quickly adapt to new data, few-shot learning aims at learnin...

0 Thomas Adler, et al. ∙

research

∙ 09/29/2020

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution

Reinforcement Learning algorithms require a large number of samples to s...

16 Vihang P. Patil, et al. ∙

research

∙ 07/16/2020

Hopfield Networks is All You Need

We show that the transformer attention mechanism is the update rule of a...

0 Hubert Ramsauer, et al. ∙

research

∙ 03/25/2020

Large-scale ligand-based virtual screening for SARS-CoV-2 inhibitors using deep neural networks

Due to the current severe acute respiratory syndrome coronavirus 2 (SARS...

0 Markus Hofmarcher, et al. ∙

research

∙ 11/14/2019

Detecting cutaneous basal cell carcinomas in ultra-high resolution and weakly labelled histopathological images

Diagnosing basal cell carcinomas (BCC), one of the most common cutaneous...

25 Susanne Kimeswenger, et al. ∙

research

∙ 11/10/2019

Using LSTMs for climate change assessment studies on droughts and floods

Climate change affects occurrences of floods and droughts worldwide. How...

28 Frederik Kratzert, et al. ∙

research

∙ 10/30/2019

Quantum Optical Experiments Modeled by Long Short-Term Memory

We demonstrate how machine learning is able to model experiments in quan...

40 Thomas Adler, et al. ∙

research

∙ 10/09/2019

Patch Refinement – Localized 3D Object Detection

We introduce Patch Refinement a two-stage model for accurate 3D object d...

5 Johannes Lehner, et al. ∙

research

∙ 09/25/2019

Explaining and Interpreting LSTMs

While neural networks have acted as a strong unifying force in the desig...

17 Leila Arras, et al. ∙

research

∙ 07/19/2019

Benchmarking a Catchment-Aware Long Short-Term Memory Network (LSTM) for Large-Scale Hydrological Modeling

Regional rainfall-runoff modeling is an old but still mostly out-standin...

0 Frederik Kratzert, et al. ∙

research

∙ 03/19/2019

NeuralHydrology - Interpreting LSTMs in Hydrology

Despite the huge success of Long Short-Term Memory networks, their appli...

14 Frederik Kratzert, et al. ∙

research

∙ 03/07/2019

Interpretable Deep Learning in Drug Discovery

Without any means of interpretation, neural networks that predict molecu...

20 Kristina Preuer, et al. ∙

research

∙ 06/20/2018

RUDDER: Return Decomposition for Delayed Rewards

We propose a novel reinforcement learning approach for finite Markov dec...

2 Jose A. Arjona-Medina, et al. ∙

research

∙ 03/26/2018

Fréchet ChemblNet Distance: A metric for generative models for molecules

The new wave of successful generative models in machine learning has inc...

0 Kristina Preuer, et al. ∙

research

∙ 02/13/2018

First Order Generative Adversarial Networks

GANs excel at learning high dimensional distributions, but they can upda...

0 Calvin Seward, et al. ∙

research

∙ 08/29/2017

Coulomb GANs: Provably Optimal Nash Equilibria via Potential Fields

Generative adversarial networks (GANs) evolved into one of the most succ...

0 Thomas Unterthiner, et al. ∙

research

∙ 06/26/2017

GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium

Generative Adversarial Networks (GANs) excel at creating realistic image...

0 Martin Heusel, et al. ∙

research

∙ 06/08/2017

Self-Normalizing Neural Networks

Deep Learning has revolutionized vision via convolutional neural network...

0 Günter Klambauer, et al. ∙

research

∙ 11/23/2015

Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)

We introduce the "exponential linear unit" (ELU) which speeds up learnin...

0 Djork-Arné Clevert, et al. ∙

research

∙ 03/04/2015

Toxicity Prediction using Deep Learning

Everyday we are exposed to various chemicals via food additives, cleanin...

0 Thomas Unterthiner, et al. ∙

research

∙ 02/23/2015

Rectified Factor Networks

We propose rectified factor networks (RFNs) to efficiently construct ver...

0 Djork-Arné Clevert, et al. ∙

Sepp Hochreiter

Featured Co-authors

Sign in with Google

Consider DeepAI Pro