big data: challenges in agriculture - research park...

28
Big Data: Challenges in Agriculture Big Data Summit, November 2014 Moorea Brega: Agronomic Modeling Lead – The Climate Corporation

Upload: lamcong

Post on 22-Mar-2018

220 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Big Data: Challenges in Agriculture

Big Data Summit, November 2014 Moorea Brega: Agronomic Modeling Lead – The Climate Corporation

Page 2: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Outline

THE AGRICULTURAL

CHALLENGE

THE ROLE OF DATA SCIENCE

DATA SCIENCE MEETS

AGRICULTURE

Page 3: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

The Agricultural Challenge

Page 4: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

The Agricultural Challenge

Page 5: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Forty decisions, Forty outcomes

Page 6: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Key Decisions

equipment selection results analysis seed selection crop selection

fertility management water management

planting logistics planting practices

scouting inputs fertility

harvest logistics harvest timing grain marketing

Each season about 40 management decisions are made

Page 7: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Sequential Multi-Arm Bandit

Bayesian network

Each season about 40 management decisions made

many decisions

one outcome

Page 8: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Forty outcomes A typical farmer will manage 40 seasons

a typical farmer

manages 40 seasons

providing 40 outcomes

Page 9: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Data Science Meets Agriculture

Page 10: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

The Next Revolution?

GREEN REVOLUTION GREEN DATA REVOLUTION

INTENSIFY Apply breeding, fertilization

to increase yields.

OPTIMIZE Apply data science to optimize

management.

BIOTECH Marker assisted selection, traits,

chemistries, microbials.

BIOTECH REVOLUTION

Page 11: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Data Available One Crop, One Season, One Country

YIELD MONITOR DATA 14B OBSERVATIONS

REMOTE SENSING DATA 260B OBSERVATIONS

WEATHER DATA 20B OBSERVATIONS

Page 12: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Yield Modeling

yield genetics environment practices variability

y = f (g, e, p) + ε

Yield is a function of genetics, the environment & farming practices

Page 13: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Yield Optimization

OPTIMIZED YIELD Yield optimized for environment by optimization of genetics and management using predictive model.

YIELD Yield optimized for environment by optimization of genetics and management traditional practices.

Page 14: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Challenges in Applying Data Science

Page 15: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Challenges

Data Challenges • Spatio-Temporal Data

• Heterogeneous Data

• Missing Data

• Noisy Data

Learning Challenges • Latent Features

• Curse of Dimensionality

• Multi-task Learning

Page 16: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

2005

1950

1870

Multi-sensor Data high spatial resolution gridded data observed data

Reanalysis Data coarse spatial resolution gridded data produced by deterministic weather models

Gauge Data sparse spatially and temporally observed data

Data Challenges Spatial Misalignment

Page 17: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Data Challenges Spatial and Temporal Misalignment

Low resolution, higher temporal frequency

High resolution, lower temporal frequency

Page 18: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Data Challenges Heterogeneous Data

Page 19: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Data Challenges Missing Data

Yield data can be missing due to: ● pest/disease ● low yield due to

other causes (heat, drought, frost, ponding)

● equipment malfunction

● data post-processing by an outside entity

Page 20: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Data Challenges Noisy Data

Noise can come from many sources: ● Clouds and

atmospheric disturbance

● Equipment malfunction/equipment calibration issues

● Measurement error ● Human error ● Mislabeled data or data

with no labels

Page 21: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Inherent Complexity

Zea mays (corn)

Genetics, Environment, Practices

Soil Processes

Nutrient Processes

Crop Processes

YIELD

Page 22: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

The Role of Data Science

Page 23: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

The Role of Data Science A Coherent View of the Field

weather sensors remote sensors ground sensors

Utilizing multiple sources of data before and during the growing season to provide growers with insights and recommendations.

Page 24: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

The Role of Data Science Identifying Crop Stress

Insights When is the crop under stress? Recommendations What actions can I take to correct this?

Page 25: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

The Role of Data Science Nutrient Applications

Grower Practices

weather data

Insights How much nitrogen is available to my crop?

Recommendation How much fertilizer should I apply?

Page 26: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

The Role of Data Science

Bayesian network

Optimize each decision for risk adjusted return

step-wise optimization of conditional expected utility

Page 27: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

The Next Revolution?

GREEN REVOLUTION GREEN DATA REVOLUTION

INTENSIFY Apply breeding, fertilization

to increase yields.

OPTIMIZE Apply data science to optimize

management.

BIOTECH Marker assisted selection, traits,

chemistries, microbials.

BIOTECH REVOLUTION

Page 28: Big Data: Challenges in Agriculture - Research Park ...researchpark.illinois.edu/sites/researchpark.illinois.edu...Big Data: Challenges in Agriculture Big Data Summit, November 2014

Questions? [email protected]