Zachary Kenton

Cited by

	All	Since 2019
Citations	2182	2035
h-index	17	15
i10-index	20	17

900

450

225

675

201520162017201820192020202120222023202416 35 26 63 79 142 200 359 888 356

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Amos StorkeyProfessor of Machine Learning and AI, School of Informatics, University of Edinburgh, UKVerified email at ed.ac.uk
Asja FischerProfessor for Machine Learning, Ruhr University BochumVerified email at ini.rub.de
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Nicolas BallasMeta AI ResearchVerified email at meta.com
Stanisław JastrzębskiChief Technology Officer & Chief Scientist @ Molecule.OneVerified email at molecule.one
Devansh ArpitRashi.aiVerified email at rashi.ai
Angelos FilosGoogle DeepMind, University of OxfordVerified email at deepmind.com
Owain EvansResearch Associate, University of OxfordVerified email at philosophy.ox.ac.uk
Andreas StuhlmüllerElicitVerified email at elicit.com
Ryan CareyUniversity of OxfordVerified email at philosophy.ox.ac.uk
Thomas McGrathResearch Scientist, DeepMindVerified email at google.com

Zachary Kenton

Google DeepMind

Verified email at google.com - Homepage

AI Safety Machine Learning Deep Learning Theoretical Physics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Ethical and social risks of harm from language models L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ... arXiv preprint arXiv:2112.04359, 2021	592	2021
Three factors influencing minima in sgd S Jastrzębski, Z Kenton, D Arpit, N Ballas, A Fischer, Y Bengio, A Storkey arXiv preprint arXiv:1711.04623, 2017	460	2017
Taxonomy of risks posed by language models L Weidinger, J Uesato, M Rauh, C Griffin, PS Huang, J Mellor, A Glaese, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022	306	2022
Alignment of language agents Z Kenton, T Everitt, L Weidinger, I Gabriel, V Mikulik, G Irving arXiv preprint arXiv:2103.14659, 2021	111	2021
A systematic comparison of bayesian deep learning robustness in diabetic retinopathy tasks A Filos, S Farquhar, AN Gomez, TGJ Rudner, Z Kenton, L Smith, ... arXiv preprint arXiv:1912.10481, 2019	108	2019
On the relation between the sharpest directions of DNN loss and the SGD step length S Jastrzębski, Z Kenton, N Ballas, A Fischer, Y Bengio, A Storkey arXiv preprint arXiv:1807.05031, 2018	105	2018
Specification gaming: the flip side of AI ingenuity V Krakovna, J Uesato, V Mikulik, M Rahtz, T Everitt, R Kumar, Z Kenton, ... DeepMind Blog 3, 2020	87	2020
Imitating interactive intelligence J Abramson, A Ahuja, I Barr, A Brussee, F Carnevale, M Cassin, ... arXiv preprint arXiv:2012.05672, 2020	72	2020
Goal misgeneralization: why correct specifications aren't enough for correct goals R Shah, V Varma, R Kumar, M Phuong, V Krakovna, J Uesato, Z Kenton arXiv preprint arXiv:2210.01790, 2022	45	2022
The squeezed limit of the bispectrum in multi-field inflation Z Kenton, DJ Mulryne Journal of Cosmology and Astroparticle Physics 2015 (10), 018, 2015	40	2015
D-brane potentials in the warped resolved conifold and natural inflation Z Kenton, S Thomas Journal of High Energy Physics 2015 (2), 1-42, 2015	36	2015
Finding flatter minima with sgd S Jastrzębski, Z Kenton, D Arpit, N Ballas, A Fischer, Y Bengio, A Storkey	33	2018
Width of minima reached by stochastic gradient descent is influenced by learning rate to batch size ratio S Jastrzębski, Z Kenton, D Arpit, N Ballas, A Fischer, Y Bengio, A Storkey Artificial Neural Networks and Machine Learning–ICANN 2018: 27th …, 2018	28	2018
Generalizing from a few environments in safety-critical reinforcement learning Z Kenton, A Filos, Y Gal, O Evans Safe Machine Learning workshop at ICLR, 2019	24*	2019
The separate universe approach to soft limits Z Kenton, DJ Mulryne Journal of Cosmology and Astroparticle Physics 2016 (10), 035, 2016	23	2016
Benchmarking Bayesian deep learning with diabetic retinopathy diagnosis A Filos, S Farquhar, AN Gomez, TGJ Rudner, Z Kenton, L Smith, ... Preprint at https://arxiv. org/abs/1912.10481, 2019	21	2019
Generating the cosmic microwave background power asymmetry with Z Kenton, DJ Mulryne, S Thomas Physical Review D 92 (2), 023505, 2015	20	2015
Explaining grokking through circuit efficiency V Varma, R Shah, Z Kenton, J Kramár, R Kumar arXiv preprint arXiv:2309.02390, 2023	15	2023
Discovering agents Z Kenton, R Kumar, S Farquhar, J Richens, M MacDermott, T Everitt Artificial Intelligence 322, 103963, 2023	15	2023
Predicting Human Deliberative Judgments with Machine Learning O Evans, A Stuhlmüller, C Cundy, R Carey, Z Kenton, T McGrath, ... https://zackenton.github.io/files/predicting_judgments_final.pdf, 2018	15	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors