Applied Mathematical Seminar 55: Condensation in deep learning

发布部门：行政办公室(A) 浏览次数：22

Seminar| Institute of Mathematical Sciences

Time: Tuesday, December 19th, 2023 , 13:30-14:30

Location：RS518, IMS

Speaker: Zhiqin Xu, Shanghai JiaoTong University

Abstract: Why do neural network models that look so complex usually generalize well? To understand this problem, we study deep learning training and find that some simple implicit regularization effects. We focus on the condensation phenomenon,which indicates neurons bias towards the same during the training.This talk will discuss the mechanism and potential application of condensation.