Headlines

Inter Miami vs. Philadelphia Union 2025 MLS Odds, Time, and Prediction
56 minutes ago
JCB and First Cash Solution Partner to Help Cardmembers Unlock Seamless Payments in Germany
56 minutes ago
Maintains Operational Stability with Full-Year Revenue Surpassing RMB2 Billion in 2024
56 minutes ago
NaaS Teams Up with Xiaomi Auto
56 minutes ago
HER Courage Leaders Summit 2025: Expanding Women’s Leadership Across ASEAN
56 minutes ago
KFC Drops New “Dunk It Bucket” Featuring Mashed Potato Poppers – It’s a Feast Made for Dipping
56 minutes ago

TECHNOLOGY

Deep Learning Is Not So Mysterious or Different

lastnewshub.com12 hours ago02 mins

[Submitted on 3 Mar 2025]

View PDF
HTML (experimental)

Abstract:Deep neural networks are often seen as different from other model classes by defying conventional notions of generalization. Popular examples of anomalous generalization behaviour include benign overfitting, double descent, and the success of overparametrization. We argue that these phenomena are not distinct to neural networks, or particularly mysterious. Moreover, this generalization behaviour can be intuitively understood, and rigorously characterized using long-standing generalization frameworks such as PAC-Bayes and countable hypothesis bounds. We present soft inductive biases as a key unifying principle in explaining these phenomena: rather than restricting the hypothesis space to avoid overfitting, embrace a flexible hypothesis space, with a soft preference for simpler solutions that are consistent with the data. This principle can be encoded in many model classes, and thus deep learning is not as mysterious or different from other model classes as it might seem. However, we also highlight how deep learning is relatively distinct in other ways, such as its ability for representation learning, phenomena such as mode connectivity, and its relative universality.

Submission history

From: Andrew Wilson [view email]
[v1]
Mon, 3 Mar 2025 22:56:04 UTC (1,206 KB)

Leave a Reply Cancel reply

Related News

Show HN: Localscope–Limit scope of Python functions for reproducible execution

lastnewshub.com12 hours ago 0

Darker Than a Dark Pool? Welcome to Wall Street’s ‘Private Rooms’

Darker Than a Dark Pool? Welcome to Wall Street’s ‘Private Rooms’

lastnewshub.com12 hours ago 0

Piramidal (YC W24) is hiring a ML Engineer to decode brainwaves

Piramidal (YC W24) is hiring a ML Engineer to decode brainwaves

lastnewshub.com12 hours ago 0

Chaos in the Cloudflare Lisbon Office

Chaos in the Cloudflare Lisbon Office

lastnewshub.com12 hours ago 0

English