Follow
Stanisław Jastrzębski
Stanisław Jastrzębski
Chief Technology Officer & Chief Scientist @ Molecule.One
Verified email at molecule.one - Homepage
Title
Cited by
Cited by
Year
Parameter-Efficient Transfer Learning for NLP
N Houlsby, A Giurgiu*, S Jastrzebski*, B Morrone, Q Laroussilhe, ...
International Conference on Machine Learning (ICML) 2019, 2019
37242019
A Closer Look at Memorization in Deep Networks
D Arpit*, S Jastrzebski*, N Ballas*, D Krueger*, E Bengio, MS Kanwal, ...
International Conference on Machine Learning 2017, 2017
20652017
Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening
N Wu, J Phang, J Park, Y Shen, Z Huang, M Zorin, S Jastrzebski, T Févry, ...
IEEE Trans Med Imaging, 2019
6562019
Three factors influencing minima in SGD
S Jastrzebski*, Z Kenton*, D Arpit, N Ballas, A Fischer, Y Bengio, ...
International Conference on Artificial Neural Networks 2018; International …, 2017
5472017
Molecule Attention Transformer
Ł Maziarka, T Danel, S Mucha, K Rataj, J Tabor, S Jastrzębski
NeurIPS 2019 workshop; arXiv preprint arXiv:2002.08264, 2020
210*2020
The Break-Even Point on Optimization Trajectories of Deep Neural Networks
S Jastrzebski, M Szymczak, S Fort, D Arpit, J Tabor, K Cho, K Geras
International Conference on Learning Algorithms (ICLR) 2020, 2020
1652020
Residual connections encourage iterative inference
S Jastrzebski*, D Arpit*, N Ballas, V Verma, T Che, Y Bengio
International Conference on Learning Algorithms (ICLR) 2018, 2017
1492017
An artificial intelligence system for predicting the deterioration of COVID-19 patients in the emergency department
FE Shamout, Y Shen, N Wu, A Kaku, J Park, T Makino, S Jastrzębski, ...
NPJ digital medicine 4 (1), 80, 2021
1372021
Molecule Edit Graph Attention Network: Modeling Chemical Reactions as Sequences of Graph Edits
M Sacha, P Błaż, Mikołaj, Byrski, P Włodarczyk-Pruszyński, S Jastrzębski
Journal of Cheminformatics and Modeling (JCIM), 2020
1352020
On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length
S Jastrzębski, Z Kenton, N Ballas, A Fischer, Y Bengio, A Storkey
International Conference on Learning Algorithms (ICLR) 2019, 2019
1282019
Learning to SMILE(S)
S Jastrzebski, D Lesniak, WM Czarnecki
International Conference on Learning Representation 2016 (Workshop track), 2016
112*2016
Learning to Compute Word Embeddings on the Fly
D Bahdanau, T Bosc*, S Jastrzebski*, E Grefenstette, P Vincent, Y Bengio
Montreal AI Symposium 2017, 2017
1002017
Large Scale Structure of Neural Network Loss Landscapes
S Fort, S Jastrzebski
NeurIPS 2019, 2019
902019
Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks
N Wu, S Jastrzębski, K Cho, KJ Geras
ICML 2022, arXiv preprint arXiv:2202.05306, 2022
852022
Stiffness: A new perspective on generalization in neural networks
S Fort, PK Nowak, S Jastrzebski, S Narayanan
arXiv preprint arXiv:1901.09491, 2019
852019
How to evaluate word embeddings? On importance of data efficiency and simple supervised tasks
S Jastrzebski, D Leśniak, WM Czarnecki
arXiv preprint arXiv:1702.02170, 2017
792017
Evolutionary-Neural Hybrid Agents for Architecture Search
K Maziarz, A Khorlin, Q de Laroussilhe, S Jastrzebski, T Mingxing, ...
arXiv preprint arXiv:1811.09828, 2018
77*2018
Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization
S Jastrzebski, D Arpit, O Astrand, G Kerg, H Wang, C Xiong, R Socher, ...
International Conference on Machine Learning (ICML) 2021, 2020
632020
Cramer-Wold Auto-Encoder
S Knop, P Spurek, J Tabor, I Podolak, M Mazur, S Jastrzębski
Journal of Machine Learning Research 21 (164), 1-28, 2020
59*2020
RetroGNN: Fast Estimation of Synthesizability for Virtual Screening and De Novo Design by Learning from Slow Retrosynthesis Software
CH Liu, M Korablyov, S Jastrzębski, P Włodarczyk-Pruszyński, Y Bengio, ...
Journal of Chemical Information and Modeling 62 (10), 2293-2300, 2022
58*2022
The system can't perform the operation now. Try again later.
Articles 1–20