Author: dubai.digital

  • Branch Specialization

    Branch Specialization

    [ad_1] This article is part of the Circuits thread, an experimental format collecting invited short articles and critical commentary delving into the inner workings of neural networks. Visualizing Weights Weight Banding Introduction If we think of interpretability as a kind of “anatomy of neural networks,” most of the circuits thread has involved studying tiny little…

  • An information maximization view on the $\beta$-VAE objective

    An information maximization view on the $\beta$-VAE objective

    [ad_1] guest post with Dóra Jámbor This is a half-guest-post written jointly with Dóra, a fellow participant in a reading group where we recently discussed the original paper on $beta$-VAEs: On the surface of it, $beta$-VAEs are a straightforward extension of VAEs where we are allowed to directly control the tradeoff between the reconstruction and…