blinq media praneeth vepakomma senior data scientist

BLiNQ MEDIAPraneeth VepakommaSenior Data Scientist

Generalization in Supervised

Machine Learning

Hypothetical Knapsack of Coins:

Copper and Gold CoinsTotal number of coins is fixed and is a large sample.Capture-RecaptureWhat is the proportion of Gold coins?

Copper and Gold CoinsTotal number of coins is variable and is a large sample.Capture-RecaptureWhat is the proportion of Gold coins?

BASIC ML/STAT TERMINOLOGY:

190 Years after Gauss, the core problem of prediction remains an active problem :

Find a mapping♯ from the features:

#Approximation

is a list of parameters, required to represent the function

ExistingFeatures

KnownLabels

UnavailableFeatures

UnknownLabels

Loss Function

Assumptions

What is Supervised Learning?

Evaluating the Learned Function:

Loss Function quantifies the error in the approximation.

Learn a mapping by optimizing the loss.

Example:

Predictions with varying parameters:

How do we generalize?

Generalization and Predictability

Empirical Risk Minimization:

True Risk Minimization:

Empirical Risk is the average (expected) loss on seen data.

True Risk is the expected risk on the process generating the X,Y pairs.

PARAMETRIC CHARACTERIZATION OF THE MAPPING :

2d-Linear function: Slope, InterceptCubic Spline: Number of knots, Location of KnotsNearest-Neighbor regression: Number of neighborsLasso: L1-L2 WeightsSupport Vector Machines: Kernel width, Margin LengthRandom Forests: Resampling sample size

Long list of available Supervised Learning Techniques.

Most of the techniques have tuning parameters.

We can minimize out-of-sample performance by tuning the technique with optimal parameters.

Tuning can be performed by cross-validation over a discrete grid of parameter combinations.

CURSE OF DIMENSIONALITY-Flat World-10D World:

CURSE OF DIMENSIONALITY-Let us validate:

Structural Risk Minimization via Regularization:

Geneology?

Brief Description

Technology Overview

Hiring (What we’re looking for)http://blinqmedia.com/contact/job-openings/

Lets work with Abalone

Thank You!

blinq media praneeth vepakomma senior data scientist

d world

flat world

expected risk

curse of dimensionality

tuning parameters

true risk minimization

proportion of gold coins

list of parameters

Documents

what is a scientist?. brainstorm and record: what does a...

serial peripheral interface [spi] controller area network...

by: carly snyder catie liu kelsey barter nick mitchell...

6) way cool scientist = climate scientist

mad scientist

nutri news - harvard university · jeremy furtado, research...

non-gaussianity of stochastic gradient noise · microsoft...

unit #1: being a scientist thinking like a scientist

prospectus senior scientist principal scientist

a scientist?

scientist? are you smarter than a scientist? scientist?

fwc-122hg-35 installation guide - blinq networks

skills mendu praneeth

fw-300i quick installation guide - blinq networks

gastroparesis: pathophysiology and management preceptor: dr....

laboratory scientist · + laboratory manager + senior...

isaacnewton— thegreat english scientist isaacnewton—...

scientist experiments

graphic1 - magicbricks · praneeth pranav pride @...

richard rudner "the scientist qua scientist makes value...