supplemental material: overdispersed variational...

3

Supplemental Material: Overdispersed Variational Autoencoders Harshil Shah and David Barber Department of Computer Science University College London Neural network architectures Below, we describe the architecture of all of the neural networks used for the parameters of the generative and variational distributions. Note that all hidden units used the tanh nonlinearity. Hidden layers Output nonlinearity MNIST π θ (h) 2 × 200 units sigmoid η MNIST φ (x) 2 × 200 units linear Ω MNIST φ (x) 2 × 200 units exp FF μ θ (h) 2 × 100 units sigmoid Σ θ (h) 2 × 100 units exp η FF φ (x) 2 × 100 units linear Ω FF φ (x) 2 × 100 units exp Variance of gradient estimates Below, we show the sample variances of the gradient estimates during the first 1,000 iterations of training, for both datasets. MNIST VAE vs. OVAE Copyright c 2017, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. IWAE vs. OIWAE For the MNIST dataset, The OVAE gradient estimates have noticeably lower variance than do the VAE updates. Comparing the OIWAE against the IWAE, this is less clear, except in the first 300 training iterations. Frey Faces VAE vs. OVAE

Upload: others

Post on 03-Sep-2020

4 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Supplemental Material: Overdispersed Variational Autoencodersweb4.cs.ucl.ac.uk/staff/D.Barber/publications/ShahBarberSupplemen… · Harshil Shah and David Barber Department of Computer

Supplemental Material: Overdispersed Variational Autoencoders

Harshil Shah and David BarberDepartment of Computer Science

University College London

Neural network architecturesBelow, we describe the architecture of all of the neural net-works used for the parameters of the generative and varia-tional distributions. Note that all hidden units used the tanhnonlinearity.

Hidden layers Output nonlinear-ity

MN

IST πππθ(h) 2 × 200 units sigmoid

ηηηMNISTφ (x) 2 × 200 units linear

ΩΩΩMNISTφ (x) 2 × 200 units exp

FF

µµµθ(h) 2 × 100 units sigmoidΣΣΣθ(h) 2 × 100 units expηηηFFφ (x) 2 × 100 units linearΩΩΩFFφ (x) 2 × 100 units exp

Variance of gradient estimatesBelow, we show the sample variances of the gradient esti-mates during the first 1,000 iterations of training, for bothdatasets.

MNISTVAE vs. OVAE

Copyright c© 2017, Association for the Advancement of ArtificialIntelligence (www.aaai.org). All rights reserved.

IWAE vs. OIWAE

For the MNIST dataset, The OVAE gradient estimateshave noticeably lower variance than do the VAE updates.Comparing the OIWAE against the IWAE, this is less clear,except in the first 300 training iterations.

Frey Faces

VAE vs. OVAE

Page 2: Supplemental Material: Overdispersed Variational Autoencodersweb4.cs.ucl.ac.uk/staff/D.Barber/publications/ShahBarberSupplemen… · Harshil Shah and David Barber Department of Computer

IWAE vs. OIWAE

As with MNIST, for the Frey Faces dataset, the OVAEgradient estimates have noticeably lower variance than dothe VAE updates. However, this is not the case for the OI-WAE when compared against the IWAE, which have verysimilar variances.

Generated output

Below we show sample outputs generated from the prior andposterior of the learned models, for both datasets.

MNIST

OVAE: Prior

OVAE: Posterior

OIWAE: Prior

OIWAE: Posterior

Page 3: Supplemental Material: Overdispersed Variational Autoencodersweb4.cs.ucl.ac.uk/staff/D.Barber/publications/ShahBarberSupplemen… · Harshil Shah and David Barber Department of Computer

Frey Faces

OVAE: Prior

OVAE: Posterior

OIWAE: Prior

OIWAE: Posterior

Harshil zaveri (41)

Thesis Identification and Characterization of Super …...Thesis Identification and Characterization of Super-spreaders from Voluminous Epidemiology Data Submitted by Harshil Shah

Harshil associates

OVERDISPERSED GENERALIZED LINEAR MODELS D.K. Dey … · FOR THE OFFICE OF NAVAL RESEARCH ... Required Bayesian computation is handled through a Metropolis-within- ... as well as under

· HARYANA HIMACHAL PRADESH JAMMU & KASHMIR JHARKHAND KARNATAKA KERALA MAHARASHTRA MADHYA PRADESH ORISSA PUDDUCHERRY ... Gaurav Vast HARSHIL NARNA neeraj kumar Vivek Kumar Sushant

Harshil Industries

Use Case – MVC & Framework CS 537 Advance Software Engineering Group 2 Kunal Naik Harshil Shah Henil Patel Jwalant Desai Ankur Patel

Harshil Parikh Bluray Disc 359

Cells and System Moose, Harshil, Harsh and Darsh

B.Tech. Semester -1 Counsellor Name- Prof. Harshil Patel ... · 30 201803100810082 GOHIL SUNILKUMAR BABUBHAI IT 31 201803100810083 VAKIL KRISHITA NIRMESH IT 32 201803100810106 CHENNURI

Subject In charge – Harshil Shah PPI – 2 nd Shift Mechanical Engineering Dept

1 A Generative Model for Music Transcriptionweb4.cs.ucl.ac.uk/staff/D.Barber/publications/cemgil-kappen-barber... · A Generative Model for Music Transcription Ali Taylan Cemgil,

· deepak kailoria dev mandal disha chakrabortty harshil manchanda indrajeet bannerjie jayant r. belgaumkar karishma karthik kartikeya jha lumana upreti mahima chauhan manjari rathor

DHRUVAL SHAH DIMPLE SHAH HARSHIL SHAHudgamschool.net/udgamschoolapp/Impdoc/2015637_Transfer.pdf · INSTRUCTIONS REGARDING ISSUE / COUNTERSIGNATURE OF TRANSFER CERTIFICATE Transfer

Analysis of Overdispersed Data in SAS

openaccess.sgul.ac.ukopenaccess.sgul.ac.uk/108033/1/Paper.doc · Web viewWord count: 4,689. Disclosures: Harshil Dhutia, Aneil Malhotra, Gherardo Finocchiaro, Lynne Millar and Rajay

Advances in Models for Acoustic Processingweb4.cs.ucl.ac.uk › ... › D.Barber › workshops › amac › amac-intro.pdfAdvances in Models for Acoustic Processing David Barber and

Ba y esian Classi cation with Gaussian Pro cesses ...web4.cs.ucl.ac.uk/staff/D.Barber/publications/williams-barber-pami... · Intel ligenc e, 20(12) 1342-1351, (1998). Legal ... Arti

11 steps of testing process - By Harshil Barot

Harshil Industries, Valsad, Ammonium Bi Fluoride

Package ‘mvabund’ · 2013. 8. 29. · Warton, D. I. and Guttorp, P. (in press April 2010). Compositional analysis of overdispersed counts using generalized estimating equations

Uncertainty in modeling overdispersed queuesUncertainty in modeling overdispersed queues-An analysis of scaling limits for in nite-server queues in a random environment. Mariska Heemskerk

PREPARED BY: GUIDED BY: CHAVDA PARAKRAM(130540106017) PROF. HIREN H. MAKWANA MANIYAR DHARMI(130540106071) TANNA HARSHIL(130540106117) PANCHANI DENISHA(130540106077)

Industrial Chemicals By Harshil Fluoride

Finite-size eﬁects in on-line learning of multilayer …web4.cs.ucl.ac.uk/staff/D.Barber/publications/barber...Finite-size eﬁects in on-line learning of multilayer neural networks

Components of 4-stroke Engine Created by:- Prashanth Nair Sawan Makwana Jigar Chauhan Harshil Gohel Saumil Joshi

Sound Organization By Source Models in Humans and Machinesweb4.cs.ucl.ac.uk/staff/D.Barber/workshops/amac/... · Sound Organization by Models - Dan Ellis 2006-12-09 - /291 1. Mixtures

Implication of backward contact tracing in the presence of … · 2020. 8. 1. · Implication of backward contact tracing in the presence of overdispersed transmission in COVID-19

Research Paper-Harshil Shah 317

CHAPTER 12 Mixture Models for Overdispersed Data Jonathan

Modeling the Number of Ignitions Following an Earthquake ... Modeling th… · Modeling the Number of Ignitions Following an Earthquake: Developing Prediction Limits for Overdispersed

Russell D.Barber Newsletter - All Schools · attend and share their voices without holding an executive position on the council. This is a wonderful way to participate ... Strong

PATEL CHINTAN (130120119136) POLARA JAYDIP (130120119178) PATEL HARSH J (130120119140) PATEL HARSHIL (130120119142), ASHUTOSH KUMA(130120119009) TOPIC:AIR

Variational Autoencoders for Sparse and Overdispersed ... · sulting Negative-Binomial Variational AutoEncoder (NBVAE for short) is a VAE-based framework gen-erating data with a NB

Interpolation of data collected in polygons of data collecte… · epidemiological and social data. We show examples of Gaussian, overdispersed Poisson and binomial areal kriging