Stéphane d'Ascoli
École Normale Supérieure
H-index: 14
Europe-France
Top articles of Stéphane d'Ascoli
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Odeformer: Symbolic regression of dynamical systems with transformers | arXiv preprint arXiv:2310.05573 | Stéphane d'Ascoli Sören Becker Alexander Mathis Philippe Schwaller Niki Kilbertus | 2023/10/9 |
Boolformer: Symbolic regression of logic functions with transformers | arXiv preprint arXiv:2309.12207 | Stéphane d'Ascoli Samy Bengio Josh Susskind Emmanuel Abbé | 2023/9/21 |
Length generalization in arithmetic transformers | arXiv preprint arXiv:2306.15400 | Samy Jelassi Stéphane d'Ascoli Carles Domingo-Enrich Yuhuai Wu Yuanzhi Li | 2023/6/27 |
Optimal learning rate schedules in high-dimensional non-convex optimization problems | arXiv preprint arXiv:2202.04509 | Stéphane d'Ascoli Maria Refinetti Giulio Biroli | 2022/2/9 |
Deep symbolic regression for recurrence prediction | Stéphane d’Ascoli Pierre-Alexandre Kamienny Guillaume Lample Francois Charton | 2022/6/28 | |
End-to-end symbolic regression with transformers | Advances in Neural Information Processing Systems | Pierre-Alexandre Kamienny Stéphane d'Ascoli Guillaume Lample François Charton | 2022/4/22 |
On the interplay between data structure and loss function in classification problems | Advances in Neural Information Processing Systems | Stéphane d'Ascoli Marylou Gabrié Levent Sagun Giulio Biroli | 2021/12/6 |
Align, then memorise: the dynamics of learning with feedback alignment | Maria Refinetti Stéphane d’Ascoli Ruben Ohana Sebastian Goldt | 2021/7/1 | |
Convit: Improving vision transformers with soft convolutional inductive biases | Stéphane d’Ascoli Hugo Touvron Matthew L Leavitt Ari S Morcos Giulio Biroli | 2021/7/1 | |
Transformed CNNs: recasting pre-trained convolutional layers with self-attention | arXiv preprint arXiv:2106.05795 | Stéphane d'Ascoli Levent Sagun Giulio Biroli Ari Morcos | 2021/6/10 |
Double trouble in double descent: Bias and variance (s) in the lazy regime | Stéphane d’Ascoli Maria Refinetti Giulio Biroli Florent Krzakala | 2020/11/21 | |
Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems | Stéphane d’Ascoli Alice Coucke Francesco Caltagirone Alexandre Caulier Marc Lelarge | 2020 | |
Comprendre la révolution de l'intelligence artificielle | Stéphane d'Ascoli | 2020/6/11 | |
Scaling description of generalization with number of parameters in deep learning | Journal of Statistical Mechanics: Theory and Experiment | Mario Geiger Arthur Jacot Stefano Spigler Franck Gabriel Levent Sagun | 2020/2/4 |
Triple descent and the two kinds of overfitting: Where & why do they appear? | Advances in Neural Information Processing Systems | Stéphane d'Ascoli Levent Sagun Giulio Biroli | 2020 |