PostDoctoral Researcher
enricoventura.pm@gmail.com
I'm a researcher at the Theoretical and Scientific Data Science unit (TSDS) in SISSA, Trieste.
My research investigates the Statistical Mechanics of Disordered Systems and its multidisciplinary applications.
I am mainly interested in the emergent properties of learning systems, such as memorization and generalization in neural networks and generative models.
Diffusion models represent the state of the art in image and video generation. These models learn the distribution of the data through a diffusive trajectory in the data-space, the same type of dynamical processes studied by stochastic thermodynamics. Statistical mechanics has recently helped at providing for crucial insights about the complex dynamics with which such models learn the data distribution. We are now studying the learning dynamics in case of stractured training data, i.e. data-points that live on a low dimensional manifold. We are exploiting Random Matrix Theory and the physics of Random Energy Models to answer the following questions:
Classifier-free Guidance (CFG) is a simple yet effective technique that helps diffusion models follow a user’s prompt. By combining standard unconditional diffusion with diffusion conditioned on a specific class of the data, it steers generation toward samples (e.g. images,videos or text) that more clearly reflect the intended content. We study the sampling dynamics of a diffusion model under CFG based on the statistical mechanics of disordered systems. Specifically, we study the time-dependent transformation of the diffusion potential providing a quantitative prediction of the way a complex target distribution is deformed to improve data generation. Moreover, we leverage our results to propose alternative theory-based guidance schedules that enhance such beneficial effects.
The human brain is a complex system that is fascinating scientists since a long time. Its remarkable capabilities include classification of concepts, retrieval of memories and creative generation of new examples. At the same time, modern artificial neural networks are trained on large amounts of data to accomplish the same type of tasks with a considerable degree of precision. By contrast with biological systems, machines appear to be either significantly slow and energetically expensive to train, suggesting the need for a paradigmatic change in statistical learning. We evaluate a known training procedure for recurrent neural networks that can be split into a prior Hebbian learning phase (Learning) and a subsequent anti-Hebbian one (Unlearning). We are progressively proving that this unsupervised prescription is capable of performing classification, memorization and generation of examples with a high degree of efficacy while aligning with some modern biological theories of learning.
(*) first co-author.
I am interested in analysing and divulgating issues relative to gender discrimination and inequalities in academia, especially in the STEM environment.
Please reach out to me if you want to collaborate !
Here is a series of posters assembled with the Gender Balance Working Group of La Sapienza Physics Department, obtained from comic strips by Did This Really Happen?! and displayed at La Sapienza in Autumn 2023.