magical methods for batch data processing

51
Potent Potions for Batch Data Processing

Upload: safe-software

Post on 12-Apr-2017

150 views

Category:

Technology


9 download

TRANSCRIPT

Page 1: Magical Methods for Batch Data Processing

Potent Potions for Batch Data Processing

Page 2: Magical Methods for Batch Data Processing

250,000 CAD files & rasters on mobile devices

Page 3: Magical Methods for Batch Data Processing
Page 4: Magical Methods for Batch Data Processing

Tip: Know Your Potions and Choose Wisely

Page 5: Magical Methods for Batch Data Processing

Today’s Potions

1. Wildcards

2. Batch Deploy

3. Parent/child Workspaces

4. Parent/child Server Workspaces

Page 6: Magical Methods for Batch Data Processing

Potion 1: One Wild<card>

Dataset

Page 7: Magical Methods for Batch Data Processing

Multi-Dataset Picker

Page 8: Magical Methods for Batch Data Processing

Multi-Dataset Picker

Page 9: Magical Methods for Batch Data Processing

Multi-Dataset Picker

Page 10: Magical Methods for Batch Data Processing

Multi-Dataset Picker

Shapefile MapInfo

Most rasters

DWG DGN SQLite

Page 11: Magical Methods for Batch Data Processing

Dataset Wildcards

Extended glob syntax:

Symbol Matches

? Any single character

* Any sequence of zero or more characters

[chars] Any single character in chars.

[a-d] Any character between a and d inclusive

{a,b,...} Any of the sub-patterns a, b

/**/ 0 or more subdirectories

Page 12: Magical Methods for Batch Data Processing

Time to brew Potion 1

Page 13: Magical Methods for Batch Data Processing

Potion 1: Enticements

Page 14: Magical Methods for Batch Data Processing

Wildcard Bulk Data Processing

Enticements ü  Simple to set up

ü Can transform across file boundaries

- Needs memory & time

Page 15: Magical Methods for Batch Data Processing

Potion 1: Pitfalls

Page 16: Magical Methods for Batch Data Processing

Wildcard Bulk Data Processing Pitfalls

x  Recovery from data errors difficult

x  Feature Type vs File vs Format Issues

x  No granular log x  No ability to

parallelize

Page 17: Magical Methods for Batch Data Processing

Potion 2: Batch Deploy

Page 18: Magical Methods for Batch Data Processing

Batch Deploy Script Writer

Page 19: Magical Methods for Batch Data Processing

Batch Deploy Script Writer

Page 20: Magical Methods for Batch Data Processing

Batch Deploy Script Writer

Page 21: Magical Methods for Batch Data Processing

Batch Deploy Script Writer

Page 22: Magical Methods for Batch Data Processing

Batch Deploy Script Writer

Page 23: Magical Methods for Batch Data Processing

Batch Deploy Script Writer

Page 24: Magical Methods for Batch Data Processing

Time to brew Potion 2

Page 25: Magical Methods for Batch Data Processing

Potion 2: Enticements

Page 26: Magical Methods for Batch Data Processing

Batch Deploy Enticements

ü  Simple to set up ü  Runs quickly ü  Can script via

command line ü  Run on demand

Page 27: Magical Methods for Batch Data Processing

Potion 2: Pitfalls

Page 28: Magical Methods for Batch Data Processing

Batch Deploy Pitfalls

x  Recovery from data errors difficult

x  No granular log x  Destination dataset

naming can be tricky

Page 29: Magical Methods for Batch Data Processing

Potion 3: Parent/Child Workspaces

Page 30: Magical Methods for Batch Data Processing

Parent/Child Workspace Ingredients

•  Parent Workspace: –  PathReader –  WorkspaceRunner

•  Child Workspace:

–  FeatureWriter

Page 31: Magical Methods for Batch Data Processing

Parent Ingredients

Page 32: Magical Methods for Batch Data Processing

Parent Ingredients

Page 33: Magical Methods for Batch Data Processing

Child Ingredients

Page 34: Magical Methods for Batch Data Processing

Time to brew Potion 3

Page 35: Magical Methods for Batch Data Processing

Potion 3: Enticements

Page 36: Magical Methods for Batch Data Processing

Parent/Child Workspace Enticements

ü  Separate transformation from workflow

ü  Generate audit logs ü  All authored within

Workbench

Page 37: Magical Methods for Batch Data Processing

Potion 3: Pitfalls

Page 38: Magical Methods for Batch Data Processing

Parent/Child Workspace

Pitfalls

x  Not all writers can be used concurrently

x  Slow to run each child workspace separately

x  Recovery from data errors not easy if concurrent runs used

Page 39: Magical Methods for Batch Data Processing

Potion 4: Parent/Child Server

Workspaces

Page 40: Magical Methods for Batch Data Processing

Parent/Child Server

Workspace Ingredients

•  Parent Workspace: –  PathReader –  FMEServerJobSubmitter –  FMEServerJobWaiter

•  Child Workspace:

–  FeatureWriter

Page 41: Magical Methods for Batch Data Processing

Parent Ingredients

Page 42: Magical Methods for Batch Data Processing

Parent Ingredients

Page 43: Magical Methods for Batch Data Processing

Parent Ingredients

Page 44: Magical Methods for Batch Data Processing

Child Ingredients

Page 45: Magical Methods for Batch Data Processing

Time to brew Potion 4

Page 46: Magical Methods for Batch Data Processing

Potion 4: Enticements

Page 47: Magical Methods for Batch Data Processing

Parent/Child Server

Workspace Enticements

ü  Separate transformation from workflow

ü  Generate audit logs ü  All authored within

Workbench ü  Make full use of

parallelism = FAST

Page 48: Magical Methods for Batch Data Processing

Potion 4: Pitfalls

Page 49: Magical Methods for Batch Data Processing

Parent/Child Server

Workspace Pitfalls

x  Not all writers can be used concurrently

x  Data needs to be accessible to Server Engines - Consider using Server Data Resources

x  Craft your reload/audit plan

Page 50: Magical Methods for Batch Data Processing

Summary ●  Many ways to handle bulk data moves

●  Choose your potion wisely - each has pluses and minuses

●  FME Server is the most robust automation choice

Page 51: Magical Methods for Batch Data Processing

Questions?

Batch processing tutorial: fme.ly/b59