teaching data and computational journalism

98

Upload: -

Post on 14-Jan-2017

86 views

Category:

Career


9 download

TRANSCRIPT

Page 1: Teaching Data and Computational Journalism
Page 2: Teaching Data and Computational Journalism

0

1

2

3

4

4.1

4.2

4.3

4.4

5

5.1

5.2

5.3

5.4

5.5

5.6

6

6.1

6.2

6.3

6.4

6.5

7

7.1

7.2

7.3

7.4

7.5

TableofContentsTeachingDataandComputationalJournalism

Preface

ExecutiveSummary

Introduction

Chapter1:DefiningtheFieldofStudy

What'sinaName?

FourKeyAreasofDataJournalism

ABriefHistoryofComputersandJournalists

TheTaskatHand:CausesforConcernandReasonsforHope

Chapter2:StateoftheField:OurQuantitativeData

TheScopeofOurStudy

OurFindings

TeachingDataFundamentals:RowsandColumns

TeachingAdvancedDataSkills:VisualizationandProgramming

AlternativeDataInstruction:TheStateofOnlineCourses

Textbooks:LittleConsensus

Chapter3:QualitativeFindings:InterviewsandObservations

IdentifyingWhattoTeach

TheCodingIssue

InstitutionalChallenges:Resources

InstitutionalChallenges:FacultyExpertise

InstitutionalChallenges:StudentEngagement

Chapter4:ModelCurriculainDataandComputation

IntroductionandSummaryofCurricularRecommendations

Model1:IntegratingDataasaCoreClass

Model2:IntegratingDataandComputationintoExistingCoursesandConcentrations

Model3:ConcentrationinData&Computation

Model4:AdvancedGraduateDegree:Expertise-DrivenReportingonData&Computation

TeachingDataandComputationalJournalism

2

Page 3: Teaching Data and Computational Journalism

7.6

8

8.1

8.2

8.3

8.4

8.5

8.6

8.7

9

10

Model5:AdvancedGraduateDegree:EmergingJournalisticTechniquesandTechnologies

Chapter5:InstitutionalRecommendations

FacultyDevelopmentandRecruitment

TrainingsorModules

IncomingSkills,TechnicalLiteracies,andBootCamps

TechnologyInfrastructure

BenefitsofDistanceorOnlineLearning

FosteringCollaboration

NoteonSpecialistFacultyinDataandComputation

Appendix

Acknowledgments

TeachingDataandComputationalJournalism

3

Page 4: Teaching Data and Computational Journalism

Thiswork,acollaborationbetweenresearchersfromColumbiaJournalismSchoolandStanfordUniversity,wasmadepossiblebyagrantfromtheJohnS.andJamesL.KnightFoundation.

AprofessionallydesignedPDFisavailablehere:http://bit.ly/cjsteachdata

TeachingDataandComputationalJournalism

4TeachingDataandComputationalJournalism

Page 5: Teaching Data and Computational Journalism

PrefaceThedigitalrevolutionusheredinfundamentalchangesinhowinformationisstructured.Italsobroughtchangesinhowgovernmentsandcorporationsuseinformationtoexercisepower.Governmentsnowinfluencecommunitiesthroughthemanagementoflargedatasets,suchasintheallocationofservicesthroughpredictivepolicing.Theyholdexclusiveaccesstodatathatwouldhelpustounderstandwhichpoliciesareworking,orhowvulnerablepopulationsareaffectedbytheexerciseofpublicpolicy.Corporationswriteopaquealgorithmstodeterminewhogetsinsuranceatwhatprice.Thesedevelopmentschallengejournalismtomovewellbeyondadaptationtosocialmediaortheadoptionofnewtechnologiesforvisualization.Theyimplicatejournalism’spublicpurpose.Encouragingly,anewfacetofjournalisticpracticeisemerging,adaptingtechnologytoreportinginthepublicinterest.

Thisisanimportantreasonwhywemustteachjournaliststoworkwithdata:Therearevitalquestionstobeaskedthatrequirenumeracy,andtherearebigstoriestofindandtellinnewways.Theintellectualhistoryofjournalismrevealsacontinuousinterrogationofemergingtechnologiesfortheirrelevancetotheprofession’spublicpurposeandconcerns.Weneedjournaliststobepositionedtoassesstechniqueslikenaturallanguageprocessingandfacialrecognitionfortheirrelevanceandpromiseastoolsofreporting,aswellasfortheirethicaldangers.

Thisiswherejournalismeducationmayplayaleadershiprole.Integratingcomputation,datascience,andotheremergingtechnologiesintopublic-spiritedreportingisanidealmissionforjournalism.Theseschoolscanaccessthefullresourcesofauniversity.Themissionalsorelievesjournalismeducatorsoftheriskofteachingperishabledigitalskillsandperishableplatforms.Datajournalismcurricularespondtoobjectivechangeinthesheeramountofinformationthatisstoreddigitallytoday–informationthatrequirescomputationtoaccessandinterrogate.Teachingjournaliststobeliterateaboutthesechangesandsometobespecialistsrequirescommittingourselvestousingdata,computation,andemergingtechnologiesasessentialtoolsofourprofession.

SteveColl

Dean&HenryR.LuceProfessor

ColumbiaJournalismSchool

TeachingDataandComputationalJournalism

5Preface

Page 6: Teaching Data and Computational Journalism

ExecutiveSummaryOverthepastcentury,journalismschoolshavedevelopedsolidfoundationsforteachingshoe-leatherreportingtechniques.Hundredsofuniversitiesteachhowtointerview,howtodevelopsources,howtocoverabeat,andhowtowriteabreakingnewsstory,afeature,asportsdispatch,oraninvestigativepiece.

Butthepracticeofdatajournalismhasbeenlargelyleftoutofthemainstreamofjournalismeducation,evenasthefield’srelativelysmallcoreofdevoteeshashoneditintoapowerfulanddynamicareaofpractice.Fordecades,datajournalistshavecompetedfortheprofession’shighestprizesandsecuredpositionsofdistinctionwithinthemostcompetitivenewsorganizations,yetourresearchhasfoundthatrelativelyfewjournalismschoolsoffercoursesinthisarea,letaloneaconcentration,evenastheseschoolshaveexpandedinstructioninpresentation-focuseddigitalskills.

Theauthorsofthisreportbelievethatalljournalismschoolsmustbroadentheircurriculatoemphasizedataandcomputationalpracticesasfoundationalskills.Toplacedatajournalisminthecoreofjournalismeducationwillmarkacrucialadvanceinwhatschoolscanoffertheirstudents.Journalistswhounderstanddataandcomputationcanmoreeffectivelydotheirjobsinaworldevermorereliantoncomplicatedstreamsofinformation.

Beyondteaching,toofewjournalismschoolssupportfacultyresearchintotoolsandtechniquesofdata-drivenreporting,despiterichopportunitiesfordevelopingtheoriesandapplicationsthatmaychangejournalisticpractice.Journalismschoolsthatembraceresearchintheirmissionscantransformthemselvesintoinnovationhubs,introducingnewtoolsandtechniquestotheprofessionandacrosstheiruniversities,insteadofmerelypreparingstudentstoenterthefield.

ThisreportoffersasnapshotofthestateofdatajournalismeducationintheUnitedStatesandoutlinesmodelsforbothintegratingtheuseofdatajournalismintoexistingacademicprogramsandestablishingnewdegreesthatspecializeindata-drivenandcomputationalreportingpractices.Whilewefocusonthestateofeducationinonecountry,wehopethattheresultsmayalsobeusefulinternationally.

Butfirst,adefinition.Whenwesay“datajournalism,”wemeanusingdataforthejournalisticpurposeoffindingandtellingstoriesinthepublicinterest.Thismaytakemanyforms:toanalyzedataandconveythatanalysisinwrittenform,toverifydatafoundinreports,tovisualizedata,ortobuildnewsappsthathelpreaderstoexploredatathemselves.Thisfieldalsoencompassestheuseofcomputation—algorithms,machinelearning,andemerging

TeachingDataandComputationalJournalism

6ExecutiveSummary

Page 7: Teaching Data and Computational Journalism

technologies—tomoreeffectivelyminebothstructuredandunstructuredinformationtofindandtellstories.Theabilitytouse,understand,andcritiquedataamountstoacrucialliteracythatmaybeappliedinnearlyeveryareaofjournalisticpractice.

Weinterviewedmorethan50journalists,educators,andstudents,andweevaluatedmorethan100journalismprogramsacrossthenation.Thisreportfeaturesachapterdetailingquantitativefindings,suchasthenumberofU.S.journalismprogramsofferingclassesindata,computation,andrelatedtechskills.Wealsoincludeachapterofqualitativefindingsinwhichourinterviewsandclassroomobservationsoffersomecolorandtexturetothispictureofthepresentstateofdatajournalismeducationanditspotential.

Amongourfindings:

Manyjournalismprogramsofferfewcoursesindatajournalism,andnearlyhalfoffernoclassesatall.Theclassesofferedarelargelyintroductory,andtheneedisstilllargelyforthebasics,suchasknowinghowtouseaspreadsheet,understanddescriptivestatistics,negotiatefordata,andcleanamessydatasetandthen“interview”ittofindastory.Thefieldoffersafewfoundationaltextbooks,butbeyondthatlacksabroadandstrongcoreofliteraturetohelpteachboththehistoryandpracticeofdatajournalism.Manyjournalismprogramsdonothaveafacultymemberskilledindatajournalism.Hiringprofessionaljournalistsasadjunctsmayposemanychallenges,oneofwhichisthatjobopeningsoutnumberqualifiedapplicants.Graduateswithdatajournalismskillsarebetterequippedtosucceed,ourinterviewsshow.Facedwithadecisiontohireanentry-levelreporterwithnodataskillsoronewhoknowshowtouseaspreadsheetorqueryadatabase,thedataskillsprovideakeyedge.

Amongourrecommendations:

Journalismschoolscancollaborateacrosstheuniversitytomeettheburgeoningneedforinstructionindataandcomputationbutshouldbewaryoftryingtooutsourcetoomuch—whileunderstandinghowtodomath,statistics,orcomputerprogrammingisanimportantcomponent,datajournalismismuchmorethanthat.Journalismprogramscanintegratealternativeteachingmethodstohelpfillthegapsintheirownfaculty.Examplesincludecooperativeteachingamongdifferentuniversitydepartments,onlinecourses,andindependenttutorialpacks.Journalismprogramscanchooseamongseveralmodelsofinstruction,allofwhichbeginwithakeycomponent:atleastonerequiredclassinanalyzingdataforstories—whathistoricallyhasbeentermedcomputer-assistedreporting(CAR).Journalismschoolsthatembracebothteachingandresearchintodatajournalism

TeachingDataandComputationalJournalism

7ExecutiveSummary

Page 8: Teaching Data and Computational Journalism

methodswillbepoisedtofundamentallyimprovethewayfuturejournalistswillinquireintomattersofpublicinterestandcommunicatewiththeiraudiences.

Followingourfindings,thisreportoutlinesseveralmodelcurriculaandgeneralrecommendations.Weofferamodelforacore,requiredcourseindatajournalism.ThenwesuggestwaysofintroducingdataandcomputationintoexistingjournalismclassessuchasEthicsandGlobalReporting.Nextcomesasetoffullmodelcurriculafordegreesandconcentrationsindataandcomputationaljournalism.Finally,weaddressarangeofinstitutionalconcernsonmattersrangingfromfindingteacherstoprovidingtechnologicalinfrastructure.

Ourobjectiveisnottoreplaceordiminishshoe-leatherreportinginjournalisminstruction,buttoaugmentitwithdata-drivenandcomputationaltechniques.Thisreportismeanttodescribethestateofdatajournalismeducation,tounderlinetheurgencyofincorporatingtheseskillstoequipthenextgenerationofreporters,andtoofferguidelinesformovingjournalismschoolsinthisdirection.

TeachingDataandComputationalJournalism

8ExecutiveSummary

Page 9: Teaching Data and Computational Journalism

IntroductionJournalismschoolshavealonghistoryofavoidingthecallforinstructioninquantitativeskills.Alittleoveracenturyago,whentheideaofteachingjournalismatthecollegelevelwaspracticallyunthinkable,JosephPulitzerwroteanessayarguingforthepotentialandcivicimportanceofjournalismeducation—allasaresponsetoseveraluniversitiesrefusingthemoneyhehadhopedtodonateinordertoestablishsuchaschool.Inthis1904essayintheNorthAmericanReview,Pulitzeroutlinedtheskillshethoughtjournalistswouldneedinordertoservethisloftycivicrole.Itwasanambitiouslist,highlightinglawandethics,historyandliterature,truthandaccuracy,aswellasarangeofmathematicalandscientificdisciplines.Amongthese,Pulitzerspecificallyinsistedoneducatingjournalistsinstatistics.

Everybodysaysthatstatisticsshouldbetaught.Buthow?Statisticsarenotsimplyfigures.Itissaidthatnothinglieslikefigures—exceptfacts.Youwantstatisticstotellyouthetruth.Youcanfindtruththereifyouknowhowtogetatit,andromance,humaninterest,humorandfascinatingrevelations,aswell.Thejournalistmustknowhowto

findallthesethings—truth,ofcourse,first.1

Thisproposalforastatisticscurriculumwaslargelyleftbehindinthewaveofjournalismprogramsthatwereestablishedinthetwentiethcentury,includingatColumbia,theschoolthatbearsPulitzer’sname.Corereportingclasseshavetaughtstudentstogather,analyze,andpresentinformation,mostlythroughshoe-leatherreportingandwritingskills.

Datajournalismandotherquantitativereportingmethods,ontheotherhand,havebeendevelopedlargelyinthefield.Workingjournalistsweretheoneswhofirstsawthepotentialofanalyzingandpresentingdata,andofadoptingtoolssuchasspreadsheetsanddatabasesforstories,visualizations,andapps.Muchoftheinstructionhascomethroughprofessionalworkshops,notinclassrooms.

Tobesure,journalismprogramshaveofferedclassesinresearchmethodology,descriptivestatistics,andbasicnumeracy.TheAccreditingCouncilonEducationinJournalismandMassCommunications(ACEJMC),whichaccreditsroughlyafourthofthejournalismprogramsinthenation,liststheabilityto“applybasicnumericalandstatisticalconcepts”amongthecorecompetenciesitexpectsitsaccreditedschoolstoteach.Thisaccreditationprocessisdesignedtobeavoluntaryprocessthathelpsschoolsmaintainqualitybymeetingasetofnationalstandards.Benchmarkssuchasstatisticalconceptsareuseful,buttheydon’tgettotheheartofwhatitmeanstoteachdatajournalism.

TeachingDataandComputationalJournalism

9Introduction

Page 10: Teaching Data and Computational Journalism

Manyofthestatisticsandnumeracycoursesrequiredforjournalismmajorsaretheoreticalinnature,ratherthanjournalistic.Despitetheinclusionofbasicstatisticsinschoolsofjournalismandcommunications,theuseofdataandcomputationasappliedtojournalismhasremainedasetofnichepractices,oftenomittedfromjournalismprograms,ouranalysisshows.

Someoftheearlyquantitativereportingmethodsthatmadethejumpfrompracticetotheclassroomcamewiththemoveofkeydatajournaliststoacademia.Overtime,videoandweb-basedmultimedia,suchasslideshowsortimelines,wereintegratedintojournalisminstruction.Webandmultimediaskillsinstructionnowseemstooutnumberdatajournalisminstruction.

Togetabettersenseofwhatisbeingtaught,wecollectedinformationonthecourseofferingsof113programslocatedwithintheUnitedStates,includingPuertoRico.FouroftheprogramsheldaprovisionalaccreditationandtheremainderwerefullyaccreditedwithACEJMC.Chapter2ofthisreportincludesalongerdiscussionofourfindings,whilefulltablesofourdatacanbefoundintheappendix.

Ofthe113programs,93offermultimediainstruction—howtodesignawebsite,launchablog,orshootvideofortheWeb.Theaveragenumberofmultimediaclasseswas3.Farfewerjournalismprogramsofferdataanalysisorvisualization.Alittlemorethanhalfoftheseuniversities,59ofthe113schoolswereviewed,regularlyofferoneormoredatajournalismclasses.Amongthe59thatteachdatajournalism,theaveragenumberofdatajournalismclassesofferedwas2.8,withamedianof2.Theaverageacrossall113schoolsinourstudywas1.4datajournalismclasseseach.

Wedefinedadatajournalismclassasbeingfocusedontheintersectionofdataandjournalism,andusingspreadsheets,statisticalsoftware,relationaldatabases,orprogrammingtowardthatend.Weexcludedcoursesinnumeracy,researchmethodologies,andstatisticsunlesstheyincludedanexplicitfocusondatajournalism.

The59programsweidentifiedasofferingdatajournalismincludedawiderangeofcourses.Ataminimum,programsofferedcoursesthattaughtstudentstousespreadsheetstoanalyzedataforjournalisticpurpose.Attheotherendofthespectrum,someschoolsprovidedthatbasicdatajournalisminstructionandfarmore,teachingmultipleclassesinprogrammingskills,suchasscrapingtheWeb,buildingnewsapps,orcreatingadvanceddatavisualizations.

Butthosemoreadvancedprogramswererare.Ofthe59programsweidentifiedthatteachatleastonedatajournalismclass,27oftheschoolsofferedjustonecourse,usuallyfoundational.Fourteenjournalismprogramsofferedtwoclasses.Just18ofthe59schoolsofferedthreeormoreclasses.

TeachingDataandComputationalJournalism

10Introduction

Page 11: Teaching Data and Computational Journalism

Forthosestudentswholearndatajournalism,arobustjobmarketawaits.Butwhenitcomestoteachingdatajournalism,it’sdifficulttofindjournaliststodoitfulltime.Manyworkasadjuncts,butthepayislow.Additionally,evenassomeuniversitiesaddclassesinwebdevelopmentandcoding,theyhavenotkeptpacewithofferingcoursesincomputer-assistedreportingskillslikelearninghowtoanalyzeandunderstanddata.

Foradvancedpositionsindatajournalism—jobsthatdealwithstatisticsandmapping,novelformsofdatavisualization,richonlinedatabases,andmachinelearning—littleisavailableinthewayofdatajournalismeducationpreparation.Studentswhostudybothcomputerscienceanddatajournalismarewellpositionedtomoveintosomeofthesemorechallengingjobs,butthereisadearthofsuchjobcandidates,saydatajournalists.

Thispaperwilldelveintothestateofdatajournalismeducationtodayandpresentthelessonslearnedfromthosewhohavetaught,studied,andpracticeddatajournalism—whatdoesn’tworkandshouldbeabandoned,andwhatworksandhowitcanbemorewidelyadopted.

Inthehopeofprovidingpracticalguidancefromleadersinthedatajournalismworld,wewilloffermodelcurriculadesignedtoreachabroadswathofeducationalinstitutions,frompublicland-grantuniversitiestoprivateinstitutions,forbothundergraduateprogramsandgraduate-levelstudy,aswellaspossibleconcentrationsandspecializedeffortsingraduateprograms.

Thegoalthroughoutistohelpjournalismeducationmovetowardamorecohesiveandthoughtfulvision,onethatwillhelptoeducatejournalistswhounderstandandusedataasamatterofcourse—andasaresult,producejournalismthatmayhavemoreauthority,yieldstoriesthatmaynothavebeentoldbefore,anddevelopnewformsofjournalisticstorytelling.

Thisvisionofbringingdatajournalismintothemainstreamofjournalismeducationhasyetonemore,broadermission:improvingthefutureofjournalismeducationprogramsfromaresearchperspective.Thepracticeofdatajournalism—analyzing,sifting,andtellingstoriesfrominformation—willincreasethecontributionofjournalismschoolstotherangeofdata-centeredfieldsemergingacrossuniversitycampuses.

Datajournalism“alsocanbeabridgetootherpartsoftheuniversity,”saidJamesT.Hamilton,aneconomistanddirectorofthejournalismprogramatStanfordUniversity,whichin2015launchedtheStanfordComputationalJournalismLab.Hepointedtopossiblecollaborationswithsocialscientistsasjustoneexample.

Theauthorsandacommitteeofprofessorsandprofessionaldatajournalistsagreethatifjournalistsandjournalismeducatorswanttoinnovate,thenequippingourstudentswithpracticaldataskillsand,moreimportantly,adataframeofmind,isavitalpartofthepathforwardforthestudents,faculty,andadministrators.

TeachingDataandComputationalJournalism

11Introduction

Page 12: Teaching Data and Computational Journalism

1.Pulitzer,“PlanningaSchoolofJournalism,”p.53.↩

TeachingDataandComputationalJournalism

12Introduction

Page 13: Teaching Data and Computational Journalism

Chapter1:DefiningtheFieldofStudy

TeachingDataandComputationalJournalism

13Chapter1:DefiningtheFieldofStudy

Page 14: Teaching Data and Computational Journalism

What'sinaName?Inourview,datajournalismasafieldencompassesasuiteofpracticesforcollecting,analyzing,visualizing,andpublishingdataforjournalisticpurposes.Thisdefinitionmaywellbedebated.Thehistoryofdatajournalismisfullofargumentsaboutwhatitshouldbecalledandwhatitincludes.

Infact,datajournalismhasbeenevolvingeversinceCBSusedacomputertosuccessfullypredicttheoutcomeofthepresidentialelectionin1952.Astechnologyhasadvanced,sohastheabilityofjournaliststotapthattechnologyanduseitforimportantstorytelling.

Onekeydefinitionofdatajournalismcanbefoundina2014reportbyAlexanderHowardfortheTowCenterforDigitalJournalismandKnightFoundation.Datajournalismis“gathering,cleaning,organizing,analyzing,visualizing,andpublishingdatatosupportthecreationofactsofjournalism,”Howardwrote.“Amoresuccinctdefinitionmightbesimplytheapplicationofdatasciencetojournalism,wheredatascienceisdefinedasthestudyofthe

extractionofknowledgefromdata.”1

Butnewsgames,dronejournalism,andvirtualreality—approachesthatsomemaynotconsidermainstreamdatajournalismtoday—mayrepresentamuchmoredominantpresencetomorrow.Ordatajournalismmayevolveinyetanotherdirection,perhapsintocommonapplicationsformachinelearningandalgorithms.Datajournalistsarealreadyworkingmorewithunstructuredinformation(text,video,audio)asopposedtothehistoricalelementsofdatajournalism(spreadsheetsanddatabasesfullofrowsandcolumnsofnumbers).

“Ithinktheonegoodthingaboutthenamediscussionisthatpeoplearerealizingtherearedifferentkindsofapproachestodataforjournalism,”saidBrantHouston,theKnightChairinInvestigativeandEnterpriseReportingattheUniversityofIllinoisatUrbana-ChampaignandaformerexecutivedirectorofInvestigativeReportersandEditors(IRE).

Theever-evolvingpracticeofdatajournalismhasatheartrepresentedwhatjournalistsdobest—pushagainsttheboundariesofwhatisexpected.Editorsusedtoarguethatreaderswouldn’tunderstandascatterplotpublishedinthenewspaper.Today,theNewYorkTimes’sUpshot,Fivethirtyeight.com,andothersregularlyprovideinformativedatagraphicsandvisualizations.

Atthesametime,eachgenerationofdatajournalistshasinformedthenextandbalancedthedesiretotrynewmethodswithfoundationalethicsandtransparency.

TeachingDataandComputationalJournalism

14What'sinaName?

Page 15: Teaching Data and Computational Journalism

Ourstudyaimstoprovideabroadevaluationofmanyareasofjournalisticpracticeinvolvingdataandtoidentifybestpracticesforteachingtheseskillsandthe“dataframeofmind”thatgoeswiththem.Indoingsowelookedatmultipleformsofdatajournalismanddefinedthemasbestwecouldtoensureclearcommunication.

1.Howard,“TheArtandScienceofData-DrivenJournalism,”p.4.↩

TeachingDataandComputationalJournalism

15What'sinaName?

Page 16: Teaching Data and Computational Journalism

FourKeyAreasofDataJournalismForthisreport,wewilldividedatajournalismintofourcategories,acknowledgingthatoverlapisinevitableinpractice.Examplesofjournalismthatfallundereachoftheseheadingscanbefoundintheappendix.

DataReportingDefinition:Obtaining,cleaning,andanalyzingdataforuseintellingjournalisticstories.

Includes:

Deployingcomputer-assistedreportingoranalysisforwritingjournalisticstoriesPracticingprecisionjournalism,asintroducedbyPhilipMeyer,includingtheuseofsocialscienceresearchmethodsintheinterestofjournalismVisualizingdata—mappingandcharting—foruseinexplorationandanalysisProgrammingtoobtainandanalyzedataforwritingjournalisticstories

TechniquesandTechnologies:

InvokingpublicrecordslawtonegotiatefordataUsingwebscrapingtoolsandtechniques(rangesfromtoolstoknowledgeofPythonprogramminglanguage)Usingrelationaldatabasesoftware(canrangefromMicrosoftAccesstoMySQL)Understandingstatisticalconceptsandsoftwareorprogramminglanguageswithstatisticalpackages(SPSSorRamongothers)Usingmappingandvisualizationtoolsandsoftware(Tableau,Esrimappingsoftware,QGIS,GoogleFusion)

DataVisualizationandInteractivesDefinition:Usingcodefordigitalpublishing(HTML/CSS/JavaScript/jQuery)aswellasprogramminganddatabasemanagementtobuildinteractivejournalisticwork.Thisoverlapswithdesignwork,whichfallsoutsideoftraditionaldefinitionsofdatajournalism.Butvisualizationsandappsalsocanbeintegraltothestorytellingprocess.

Includes:

Visualizationsdevelopedanddesignedasinteractivechartsandgraphicsforpresentation,includingtheuseofcode

TeachingDataandComputationalJournalism

16FourKeyAreasofDataJournalism

Page 17: Teaching Data and Computational Journalism

Interactiveapplications,includingsearchabledatabasesandgamesthathelpreadersexploreandunderstandanewsstory;theseapplicationscanbeakeypartoftheutilityofadatajournalismproject

TechniquesandTechnologies:

Theuseofcode,whichisdefinedasHTMLandCSSandalsocouldincludeJavaScriptTheuseofvisualizationsoftwareorprograms,rangingfromTableauvisualizationstotheD3JavaScriptLibraryDatabasemanagementandprogramming,includingPython,webframeworkssuchDjango,FlaskandRubyonRails,andmoreMappingapplications,includingQGIS,CartoDB,Esri,TileMill,GeoDjango,andmoreServerknowledgeandtheuseofGitHub,versioning,andAgilesoftwaredevelopmenttechniques

EmergingJournalisticTechnologiesDefinition:Newdevelopmentsusingdataandtechnology.

DroneJournalismSensorJournalismVirtualandAugmentedRealityJournalism

DroneTechnologies:

“Dronejournalismisgenerallydefinedastheuseofunmannedaerialsystemstogatherphotos,videoanddatafornews.WhatseparatesDroneJournalismfromdronephotographyistheapplicationofjournalisticethicsandconsiderationofthepublicinterestwhenusing[drones].”—MattWaite,aprofessorofpracticeattheUniversityofNebraskaandfounderoftheDroneJournalismLab

Dronetechnologiescanincludeanairframe,definedbyconfiguration(suchasfixedwingormultirotor);anautopilotofvaryingcapabilities(fullautomation,minorstabilityassistance,return-to-homefail-safefunctionality);acontrolsystem(manualcontrolthroughradiosignals,automatedflightthroughsoftwareandBluetoothwirelessconnection);andasensor(camera,videocamera,multispectralcamera,otherphysicalsensor).

SensorTechnologies:

Sensortechnologiesincludeawiderangeofsoftwareandhardwaretomeasurephysicalconditionslikeairquality,motion,ornoiselevels.Thesecanbeusedtogatherdatawithasmall,portablecomputerormicrocontroller.TheRaspberryPiisalow-cost,creditcard–sizedcomputerthathasavarietyofinput/outputpinsformountingdeviceslikesensors.

TeachingDataandComputationalJournalism

17FourKeyAreasofDataJournalism

Page 18: Teaching Data and Computational Journalism

Similarly,Arduinoisanopen-sourcemicrocontrollerplatformthatiswidelyusedforprototypingwithelectroniccomponentslikesensors.Someuniversitieshavealreadybegunteachingsensorjournalismwithspecificproject-basedclasses,suchastotestenvironmentalconditionslikeairandwaterquality.

VirtualandAugmentedRealityTechnologies:Virtualreality(VR),longheraldedasanemergingdigitaltechnology,finallyappearspoisedtoenterthebroadconsumermarket.Samsung,Oculus,andGooglehavedevelopedconsumerVRheadsetsalongwithcontrollerstofacilitateinteractivityusingyourhandsandfeet.Fromaproductionstandpoint,panoramicimagesandvideosmaybestitchedtogetherfromanarrayofcameras,whilethecompanyJauntisdevelopingastandalonecameratocapture3Dvideoin360-degree,immersiveformat.Yetquestionsofnarrative,audienceinteraction,andjournalisticvalueshaveyettobesettledwiththesetechnologies,evenastheNewYorkTimes,LosAngelesTimes,andPBS“Frontline”havelaunchedexploratoryventurestouseVR.Journalismschoolsneedtonotonlyprovideexposureandinstructioninthisemergingtechnology,butalsotoinquireintovaluesandbestpractices.

ComputationalJournalismDefinition:Theuseofalgorithms,machinelearning,andothernewmethodstoaccomplishjournalisticgoals.Thisareaoverlapswithdatareportingandemergingtechnologies.

Includes:

AlgorithmsthathelpjournalistsmineunstructureddatainnewwaysNewdigitalplatformstobettermanagedocumentsanddata

Technologies:

ProgramminglanguageslikePython,Ruby,andRFrameworksandapplicationslikeJupyterthatenablejournaliststomixcodeandproseastheyperformanalysisandshowthestepsintheirworkPlatformslikeOverviewthatfacilitatetheuseofcomplicatedcomputationalprocesseslikenaturallanguageprocessingandtopicmodeling

TeachingDataandComputationalJournalism

18FourKeyAreasofDataJournalism

Page 19: Teaching Data and Computational Journalism

ABriefHistoryofComputersandJournalistsIn1967,PhilipMeyerhadjustreturnedtoKnightRidder’sWashingtonBureaufromaNiemanFellowshipatHarvardUniversity,wherehehaddelvedintoadifferentareaofcomputationalmethods:socialscience.Socialsciencemethodologies,includingstatisticaltestsandsurveys,hadrecentlybeenusedbyacademicstodetailthereasonsbehindthe1965WattsriotsinLosAngeles.Meyerbelievedsimilarmethodologiescouldhavegreatimpactinjournalism.Hewasn’tbackatworkforlongwhenhewasabletoputthatbeliefintopractice.

InJuly1967,anearlymorningraidofanunlicensedbarinDetroitresultedinrioting.Crowdsofpeopleranthroughthestreets,burning,looting,andshooting.Theoriesaboundedastowhytheriotinghadoccurred.Someexpertsthoughtitwasdonebythose“onthebottomrungofsociety”withnomoneyoreducation.AsecondtheorywasthatitwascausedbytransplantedandunassimilatedSoutherners.

Meyer,onloantoKnightRidder’sDetroitFreePress,reachedouttofriendswhoweresocialscientiststodeviseasurvey,cobbletogetherfunding,andtraininterviewers.Inthesurvey,respondents,whowereguaranteedanonymity,wereaskedtoassesstheirownlevelofparticipationintheriots.Theywerealsoaskedtoindicatewhethertheyconsideredriotingacrime,whethertheysupportedfinesorjailforthelooters,andwhethertheyconsideredAfricanAmericansinDetroittobebetteroffthanthoseelsewhere.

Thesurveyresultscontradictedtheearliertheoriesandpointedtoadifferentexplanation—thattherelativegoodfortuneofmanyAfricanAmericanshighlightedmoredeeplythegapfeltbythosewhowereleftbehind.

TheFreePress’scoverageoftherioting,includingMeyer’s“swiftandaccurateinvestigationintotheunderlyingcauses,”wonthePulitzerPrizeforLocalGeneralReportingin1968andlaunchedaneweraintheuseofcomputationalmethodsintheserviceofjournalism.Meyer’sseminalbook,PrecisionJournalism:AReporter’sIntroductiontoSocialScienceMethodswaspublishedin1973andarguedthatjournaliststrainedinsocialsciencemethodswouldbebetterequippedforjournalisticworkandprovidedguidelinesfor

journaliststounderstandthosemethods.3“Thetoolsofsampling,computeranalysis,andstatisticalinferenceincreasedthetraditionalpowerofthereporterwithoutchangingthenatureofhisorhermission,”Meyerwrote,“tofindthefacts,tounderstandthem,andto

explainthemwithoutwastingtime.”4

TeachingDataandComputationalJournalism

19ABriefHistoryofComputersandJournalists

Page 20: Teaching Data and Computational Journalism

ThatpioneeringworkbyMeyeriscommonlythoughttobethebeginningofwhathasbeentermedeitherprecisionjournalismorcomputer-assistedreporting.Hisapproachinspiredotherjournalists.Theirworkinturninspiredamovementandthecreationofatrainingground.Twoacademicinstitutionsinparticular,IndianaUniversityandtheUniversityofMissouri,supportedthedevelopmentofthattrainingground.

Butinthewideracademicworld,computationalmethodsappliedtoreportinglargelydidnothaveanimpactonotheruniversityprogramsorhowjournalismwastaught.Instead,professionaljournaliststaughtotherprofessionaljournaliststhenewtechniques,andonlyasthosedatajournalistsbegantoenteracademiadiddatajournalismeducationbegintotakeawiderholdinthatsetting.

Bythe1980s,asdesktoppersonalcomputerstooktheplaceoftypewriters,andeditingterminalswereusedwithdigitalpublishingsystems,reportersbegantousesoftwareonPCstogreateffect.In1986,ElliotJaspin,areporterattheProvidenceJournal-Bulletin,useddatabasestomatchfelonsandbaddrivingrecordstoschoolbusdrivers.

In1988,BillDedman,areporterfortheAtlantaJournalConstitution,usingdatafroma9-tracktapeandwithanalysisbyDwightMorrisandinputfromtheHubertH.HumphreySchoolofPublicAffairsattheUniversityofMinnesota,showedthatbankswereredliningAfricanAmericansonloansthroughoutAtlanta,andeventuallythecountry,whileprovidingservicesineventhepoorestwhiteneighborhoods.Thatseries,“TheColorofMoney,”wonaPulitzerPrizeinInvestigativeReporting.

By1989,JaspinlaunchedtheMissouriInstituteforComputer-AssistedReporting(MICAR)attheUniversityofMissouri.Soon,hewasteachingcomputer-assistedreportingtostudentsattheuniversityandholdingbootcampsforprofessionaljournalists.Fouryearslater,in1994,aFreedomForumgrantwouldhelptheinstituteboostitspresenceandbecomeapartofIREasNICAR—theNationalInstituteforComputer-AssistedReporting.

In1990,atIndianaUniversity,formerjournalistturnedprofessorJamesBrownworkedwithIREtoorganizethefirstcomputer-assistedreportingconference,sponsoredbyIRE.HecreatedafledglinggroupcalledtheNationalInstituteforAdvancedReporting(NIAR).

“AndySchneider,atwo-timePulitzerwinner,hadjustjoinedourfacultyasthefirstRileyChairprofessor.Onedayweweretalkingabouthowsofewjournalistsusedcomputersintheirreporting,”Brownrecalledinanemail.“In1990,Idon’tknowofanyschoolsthathadsuchskillsintegratedintothecurriculum.Atthattime,anyundergraduateineventhesmallestschoolofbusinessknewhowtouseaspreadsheet.WedecidedtodosomethingaboutitandthatwashowNIARstarted.”

NIARwouldhostsixconferencesbeforedecidingtofoldtoavoidduplicatingeffortsbyIREandMICAR,Brownsaid.Still,theIndianaconferencestrainedmorethan1,000journalistsandwereaprecursortoanewera.In1993,IREandMICAR(whichlaterwouldberenamed

TeachingDataandComputationalJournalism

20ABriefHistoryofComputersandJournalists

Page 21: Teaching Data and Computational Journalism

toNICAR),heldacomputer-assistedreportingconferenceinRaleigh,NorthCarolina,thatdrewseveralhundredattendees.Thatmarkedthebeginningofanannualeventthatcontinuestoday,wherenewgenerationsofreportersandeditorslearntousespreadsheetsorquerydataandtousemapsandstatisticstoarriveatnewsworthyfindings.

In1993,thesameyearastheRaleighcomputer-assistedreportingconference,theMiamiHeraldreceivedthePulitzerPrizeforPublicServiceafterreporterSteveDoiguseddataanalysisandmappingtoshowthatweakenedbuildingrequirementswerethereasonHurricaneAndrewhadsodevastatedcertainpartsofMiami.

Muchofthisnewcomputer-assistedreportingcameaboutbecauseastheInternetemergedandbecamemoreaccessible,sotoodidtheconceptofusingacomputerinreporting.ButNICARandtheUniversityofMissouriinparticularhadabroadanddeepimpact.AgoodnumberofthemostprominentpractitionersofdatajournalismlearnedtheirskillsfromNICARandfromotherjournaliststryingtosolvesimilardatachallenges.

ThispatternisperhapsmostvisiblethroughtrackingthecareersoftheNICARtrainersthemselves.SarahCohenwaspartofaWashingtonPostteamthatreceivedthe2002PulitzerPrizeininvestigativereportingfordetailingtheDistrictofColumbia’sroleintheneglectanddeathof229childreninprotectivecare,andJenniferLaFleurhaswonmultiplenationalawardsforthecoverageofdisability,legal,andopengovernmentissues.BothwereNICARtrainers.

TeachingDataandComputationalJournalism

21ABriefHistoryofComputersandJournalists

Page 22: Teaching Data and Computational Journalism

AnotherNICARtrainerwasTomMcGinty,nowareporterattheWallStreetJournalandthedatajournalistfor“MedicareUnmasked,”whichreceivedthe2015PulitzerPrizeinInvestigativeReporting.JoCravenMcGintywasalsoaNICARtrainerandlaterworkedasadatabasespecialistattheWashingtonPostandattheNewYorkTimes;shenowwritesadata-centriccolumnfortheWallStreetJournal.HeranalysisabouttheuseoflethalforcebyWashingtonpolicewaspartofaPostseriesthatreceivedthePulitzerPrizeforPublicServiceandtheSeldenRingAwardforInvestigativeReportingin1999.

JournalistDavidDonaldmovedonfromhisNICARtrainingroletoheaddataeffortsattheCenterforPublicIntegrityandisnowdataeditoratAmericanUniversity’sInvestigativeReportingWorkshop.

AronPilhoferwasanIRE/NICARtrainerandledIRE’scampaignfinanceinformationcenter.HewentontoworkattheCenterforPublicIntegrityandtheNewYorkTimes,wherehefoundedthepaper’sfirstinteractivesteam.Today,PilhoferisdigitalexecutiveeditorattheGuardian.

TeachingDataandComputationalJournalism

22ABriefHistoryofComputersandJournalists

Page 23: Teaching Data and Computational Journalism

JustinMayo,adatajournalistattheSeattleTimes,graduatedfromtheUniversityofMissouriandworkedintheNICARdatabaselibraryandasaNICARtrainer.Hehaspairedwithreportersonworkthathasopenedsealedcourtcasesandchangedstatelawsgoverningloggingpermits.MayowasinvolvedindataanalysisandreportingonaninvestigativeprojectonproblemswithprescriptionmethadonepoliciesinthestateofWashington,whichreceivedaPulitzerPrizeforInvestigativeReportingin2012andincoveringamudslidethatreceivedaPulitzerPrizeforBreakingNewsReportingin2015.

Clearly,workingatNICARhasmeantbuildingpowerfulskills.So,too,hasattendingconferencesandbootcamps.ThestudentswhoattendedearlyNICARbootcampswere“missionaries”whoreturnedtotheirnewsroomstoteachcomputationaljournalismskillstotheircolleagues,Houstonrecalled.Foryears,theconferencesandbootcampswere“the

onlyplacewherepeoplehavehadanextensiveamountoftimetotryoutnewtechniques.”5

Bythelate1990s,astheincreasingprominenceoftheInternetledmorenewsorganizationstopoststoriesonline,journalismeducationofferedevenmoredigitallyfocusedinstruction:multimedia,onlinevideoskills,andHTMLcoding,amongothers.

Twostrands,dataanddigital,representdistinctusesofcomputerswithinjournalism.Earlycallsforjournalismschoolstoadapttochangingtechnologicalconditionswereansweredmainlywiththeadditionofdigitalclasses—learninghowtobuildawebpage,createmultimedia,andcuratecontent.

Manyoftheearlydigitallyfocusedjournalisminstructorsfacedabattleintryingtointroducenewconceptsintoprintjournalismtraditions.Datajournalisminstructors—focusingmoreondataanalysisforuseinstories—havefacedsimilarchallenges.

Meanwhile,bythe1990s,afewuniversitieshadbegunteachingdataanalysisforstorytelling.Meyer,whoin1981becameKnightChairattheUniversityofNorthCarolina,wasteachingstatisticalanalysisasareportingmethod.IndianaUniversity,withBrown,theprofessorwholaunchedthefirstCARconference,beganincorporatingthemethodsintoclasses.AndMissouriofferedcomputer-assistedreportinginstruction,thankstoJaspin;BrantHouston,anearlyNICARdirectorwholaterbecameIRE’sexecutivedirector;andothers.Otheruniversitiesbegantointroducebasicclassesorincorporatespreadsheetsintoexistingclasses.

Houston’sComputer-AssistedReporting:APracticalGuidebecameoneofthefewfoundationaltextsavailableonthesubject.Hisbook,nowinitsfourthedition,laysoutthebasicsofcomputer-assistedreporting:workingwithspreadsheetsanddatabasemanagersaswellasfindingdatathatcanbeusedforjournalism,suchaslocalbudgetsandbridgeinspectioninformation.WhatHoustondetailedinthatfirsteditionbecameessentiallyacore

TeachingDataandComputationalJournalism

23ABriefHistoryofComputersandJournalists

Page 24: Teaching Data and Computational Journalism

curriculumfordatajournalismfrom1995throughthepresentday.Houston’sworkcodifiedtheprinciplesandpracticesofcomputer-assistedreportingfromtheperspectiveofitsburgeoningcommunity.

Butthroughoutthosetwodecades,journalistsstilllearnedtheseskillsprimarilythroughtheNICARconferencesorfromotherjournalists.Formanyyears,forexample,MeyerandCohentaughtaNICARstatsandmapsbootcampattheUniversityofNorthCarolinagearedtowardteachingprofessionaljournalists.

Sincethen,bootcampshavebecomeapopularmodel,usedbyuniversitiesandotherjournalismtrainingorganizations,oftenincoordinationwithIRE/NICAR.Akeytenetofthebootcampispractical,hands-ontraining,usingdatasetsthatjournalistsroutinelyreporton,suchasschooltestscores.Tosumupthismodel,Houstonsaidit’sallabout“learningbydoing.”

Manybootcampgraduateshavegoneontorobustdatajournalismcareersandhavealsomovedintoteachinginjournalismprograms,bothasadjunctsandfull-timefaculty,wheretheyhaveintegratedthoseteachingtechniquesintotheirclasses.ThesejournalistsessentiallytookthecurriculumfromNICARandintroduceditintothewideracademicworld.

In1996,ArizonaStateUniversityluredDoigfromtheMiamiHeraldtotheacademiclifewherehehasbeenteachingdatajournalismeversince,servingastheKnightChairinJournalismandspecializingindatajournalism.ThestatsandmapsbootcampeventuallymigratedtoASUaswell.

Asjournalismprogramsbegantooffertheseclasses,theyfocusedonthebasicscoveredinHouston’sbook:negotiatingfordata,cleaningit,andusingspreadsheetsandrelationaldatabases,mapping,andstatisticstofindstories.

In2005,ASUbenefitedfromapushbytheCarnegieCorporationofNewYorkandtheJohnS.andJamesL.KnightFoundationtorevampjournalismeducation.TheschoolexpandeditsfocusonallthingsdataandmultimediawiththefoundingofNews21.Thatprogramhasfocusedheavilyonusingdatatotellimportantandfar-reachingstorieswhileteachinghundredsofstudentsjournalismatthesametime.

AtColumbia,thefirstcourseoncomputer-assistedreportingwasofferedin2003,whenTomTorok,thendataeditorattheNewYorkTimes,taughtaone-creditelective.WiththefoundingoftheStabileCenterforInvestigativeJournalismin2006,somedata-drivenreportingmethodswereintegratedintothecourseworkforthesmallgroupofstudentsselectedfortheprogram.ThenumberofofferingsindataandcomputationatColumbiahasrisensteadilysincethefoundingoftheTowCenterforDigitalJournalismin2010andtheBrownInstituteforMediaInnovationin2012.Inadditiontoresearchandtechnology

TeachingDataandComputationalJournalism

24ABriefHistoryofComputersandJournalists

Page 25: Teaching Data and Computational Journalism

developmentprojects,thesecentersbroughtfull-timefacultyandfellowstoteachdataandcomputation,aswellassuppliedgrantstosupportthecreationofnewjournalisticplatformsandmodesofstorytelling.

Columbiahasalsolaunchedseveralnewprogramsinrecentyearsthatsituatedataandcomputationalskillswithinjournalisticpractice.Oneisadual-degreeprograminwhichstudentssimultaneouslypursueM.S.degreesinbothJournalismandComputerScience—andthosestudentsmustbeadmittedtobothprogramsindependently.In2014,theColumbiaJournalismSchoolestablishedaseconddataprogram,TheLede,inparttoaidstudentsindevelopingthebroadskillsettheywouldneedtobeacompetitiveapplicanttobothJournalismandCS.TheLedeisanon-degreeprogramthatprovidesanintensiveintroductiontodataandcomputationoverthecourseofoneortwosemesters.Moststudentsarrivewithlittleornoexperiencewithprogrammingordataanalysis,butafterthreetosixmonthstheyemergewithaworkingknowledgeofhowdatabases,algorithms,andvisualizationcanbeputtonarrativeuse.PostLede,manystudentsarecompetitiveapplicantsforthedualdegree,butothersgodirectlyintothefieldasreporters.

Theemergenceoftheseinitiativesinjournalismschoolsreflectstheextenttowhichdata-drivenreportingpracticeshavebroadenedinthelastdecade.Inthe2000s,journalistsbegantomovewellbeyondCAR,tryingoutadvancedstatisticalanalysistechniques,crowdsourcinginwaysthatensureddataaccuracyandverification,webscraping,programming,andappdevelopment.

In2009,IREbeganworkingtoattractprogrammersandjournalistsspecializingindatavisualization,saidexecutivedirectorMarkHorvit.Italwaysofferedhands-onsessionsinanalyzingdata,mapping,andstatisticalmethods.Addedtothatnowaresessionsonwebscraping,multipleprogramminglanguages,webframeworks,anddatavisualization,amongothertopics.Thesessionshaveevenincludeddronedemonstrations.Thechallengehasbecomebalancingthepanelssothatthereisenoughofeachtypeofdatajournalism.Asaresult,theannualconferenceshavegrowntremendously,fromaround400attheCARconferenceeachyearintheearly2000stobetween900and1,000attendeestoday.

Othergroupsbeganaddressingdatajournalismaswellaspushingfornewmethodsofdigitaljournalism.TheSocietyofProfessionalJournalistswantedtoteachitsmembersaboutdataandjoinedwithIREtodoso,sponsoringregionaltwo-orthree-dayBetterWatchdogWorkshops.Minorityjournalismassociationsbegantoprovidedatajournalismtraining,oftenincollaborationwithIREoritsmembersorundertheBetterWatchdogtheme.

TheOnlineNewsAssociation’sannualconferencefocusesonthelargerworldofdigitaljournalism.Manyofitspanelsfeaturecodingforpresentation,cutting-edgedevelopmentsindigitalweb-basedproducts,audiencedevelopment,andmobile.Italsoofferspanelsondatajournalismandprogramming.

TeachingDataandComputationalJournalism

25ABriefHistoryofComputersandJournalists

Page 26: Teaching Data and Computational Journalism

Still,agaphaspersisted.Attimes,neworganizationsformedtofillsomeoftheneeds.In2009,Pilhofer,thenattheNewYorkTimes,RichGordonfromNorthwesternUniversity,andAssociatedPresscorrespondentBurtHerman,whowasjustfinishingaKnightFellowshipatStanford,createdalooselyknitorganizationthatbringstogetherjournalistsandtechnologists,hencethenameHacks/Hackers.Itsmissionistocreateanetworkofpeoplewho“rethinkthefutureofnewsandinformation.”Evenassomegroupshavetriedtofillgapsindatajournalisminstruction,whatexactlycountsasdatajournalismremainsaroughboundary,withfewdistinctionsbetweendatajournalismanddigital/webskills.Inthispaper,wecontinuetosharpenthefocusonwhatwillimprovethelevelofdatajournalismeducation,notoveralldigitalinstruction.

In2013,agroupofjournalistsusedKickstartertoraise$34,000andcreateForJournalism.com,ateachingplatformtoprovidetutorialsonspreadsheets,scraping,buildingapps,andvisualizations.FounderDaveStantonsaidthegroupwantedtofocusonteachingprogrammaticjournalismconceptsandskillsandoffersubjectsthatweren’tbeingtaught.“Youdidn’treallyevenhavetheseonlinecodeschoolthings,”hesaid.“Therewereafew.Theproblemwastherewasnocontextforjournalism.”

2.Inlatereditions,thenamechangedtoTheNewPrecisionJournalism(2013).↩

4.Meyer,PrecisionJournalism,p.3.↩

5.Foramorecompletelookatthelongandstoriedhistoryofcomputer-assistedreporting,thespring/summer2015editionoftheIREJournalprovidesadetailedandengagingrecountingbyJenniferLaFleur,NICAR’sfirsttrainingdirectorin1994andnowtheseniordataeditorattheCenterforInvestigativeReporting/Reveal.BrantHoustondetailsthathistoryin“FiftyYearsofJournalismandData:ABriefHistory,”GlobalInvestigativeJournalismNetwork,November12,2015.↩

TeachingDataandComputationalJournalism

26ABriefHistoryofComputersandJournalists

Page 27: Teaching Data and Computational Journalism

TheTaskatHand:CausesforConcernandReasonsforHopeWithdatacourseworklackinginsomanyschools,thestrongestpresenceofdatajournalisminmostofacademiahasbeenthestudyofchangingnewsroomsbysociologistsandcommunicationscholars.Theirworkaimstodocumentandexplaindatapracticeswithinongoingscholarlyconversationsaboutmedia,technology,information,andsociety.

Elsewhereinacademia,narrativeusesofdataandcomputationhaveemergedindependently.Besidestheworkofquantitativesocialscientists,likethosewhoinspiredtheworkofMeyer,significantmovementsintheartsandhumanitiestreatdataeitherasanovelinroadtotheirtraditionalobjectivesorasameanstoreinterpretthoseobjectives.Probablythebroadestofthesemovementsfallsundertheheadingofthe“digitalhumanities.”Oneofitsleadingfigures,FrancoMorettiofStanfordUniversity’sEnglishdepartment,hasdevelopedmethodsof“distantreading”bywhichoneasksquestionsofasetofbookslargerthananyonepersoncouldreadinalifetime.DennisTenen,aprofessorinColumbiaUniversity’sEnglishandComparativeLiteraturedepartmentwhohasalsotaughtattheJournalismSchool,identifieshimselfasapractitionerofcomputationalculturalstudiesandarguesthatmostdisciplineshavebynowdevelopedcomputationalmethodsthathaveeither

complementedorsupplantedtheirearlierpractices.6

Severaluniversitieshavefoundedcentersandinstitutesdevotedtoworkatthenexusofdata,computation,andhumanisticendeavors.TheUniversityofIllinois,Urbana-Champaign,forinstance,hoststheInstituteforComputinginHumanities,Arts,andSocialSciences,orI-CHASS,apartnershipbetweentheuniversityandtheNationalCenterforSupercomputingApplications.Theinstitutehelpsdeveloppartnershipsamongsocialscientistsandcomputingexperts,engineers,datascientists,andcomputerscientists.Theircollaborationshaveincludedworkonlarge-scalevideoanalysis,researchintoclimatechange,andevendigitizingandanalyzingthepapersofAbrahamLincoln.

Theusesofdataandcomputationinarchitecture,geography,andeconomicsalsoreflectthemannerinwhichthesedisciplinesadoptednewtoolsandmethodsinrecentdecades.Injournalism,ourhistoryisnotsodifferent.Likedatajournalism,computationalworkinthehumanitiesandsocialsciencesisgrowing,andthisisreflectedintherelativelyhealthyacademicjobmarketfordigitalhumanistscomparedwiththejobmarketfortraditionalscholars.

TeachingDataandComputationalJournalism

27TheTaskatHand:CausesforConcernandReasonsforHope

Page 28: Teaching Data and Computational Journalism

Overall,weseedatascienceandcomputationalmethodsbeingintroducedintodisciplinesacrossuniversitiesthat,likejournalism,havenotbeenparticularlyquantitativeinthepast.Practicesinvolvingtheuseofdataandcomputationalmethodsmaybebundledintoentirelynewdepartments,centers,researchinstitutes,anddegreeprograms(suchasdatascienceandcomputationalmedia).Itisnotthepurposeofaprogramindatajournalismtocompetewiththeseotherdisciplines,buttodevelopacurriculumthatisintrinsicallyjournalistic—onethatreflectsamissiontofindandtellstoriesinthepublicinterest—aswellasdeveloppartnershipsandcollaborationswithotherdisciplines.

OneexampleofunexpectedinterdepartmentalcollaborationatColumbiahasbeenwiththeEarthInstitute,whichhascuratedamassivedatabaseofclimatedataandofferscoursesinPythonprogramminginwhichseveralJournalismstudentshaveenrolled.Thiscoursefocusesonlargetime-seriesdatasets,whichenablesdatajournaliststoputtheclimateintocontextintheirstories.

In2013,JeanFolkerts,JohnMaxwellHamilton,andNicholasLemann—alljournalismschooldeansandtwoofthethreeofthemlongtimeprofessionaljournalists—published“EducatingJournalists:ANewPleafortheUniversityTradition.”Thepaperfocusedon“universities’roleinjournalismasaprofession”butitalsodiscussedhowthistransformationinjournalismcouldbeaboonfortheschoolsthateducatejournalists.Theauthorswrote:

Thatjournalismisgoingthroughprofoundchangesdoesnotvitiate—infact,itenhances—theimportanceofjournalismschools’becomingmorefullyparticipantintheuniversityproject.Doneproperly,thatwillproducemanybenefitsfortheprofessionatacriticaltime.Journalismschoolsshouldbeorientedtowardthefutureoftheprofessionaswellasthepresent,andtheyshouldnotbecontentmerelytotraintheirstudentsinprevailingentry-level

newsroompractices.7

Keyamongtheirrecommendationswasthis:“Weseeallthreeoftheseearlystrainsinjournalismeducation—practice-oriented,subjectmatter-oriented,andresearch-oriented—asessential.Andallofthemcanandshouldbeapplied,withpotentiallyrichresults,tothedigitalrevolution.Journalismschoolsshouldembraceallthree,notchooseoneandreject

theothers.”8

Journalismprograms,withtheirabilitytocommunicatetoageneralaudienceandtheirpotentialtoanalyzeandvisualizedataforstory,areaperfectpartnerforotherdepartments.Forexample,atStanford’snewComputationalJournalismLab(co-foundedbyoneofthisreport’sauthors),facultyareworkingonseveralprojectswithprofessorsfromotheracademicdisciplineswhoseresearchmissiontouchesonthesamedata.Onegoalisthatdatasetscanbecollected,analyzed,andusedinacademicresearchaswellasforjournalisticstorytelling.Insomeinstances,newmethodsofanalysiscanbedevelopedinconcertwithimportantpublicaccountabilityjournalismprojects.

TeachingDataandComputationalJournalism

28TheTaskatHand:CausesforConcernandReasonsforHope

Page 29: Teaching Data and Computational Journalism

Talktodeansofjournalismschoolstodayandyouwillhearthesamerefrainandthebeliefthatdatajournalism,whilenotasavior,isanincreasinglyimportantcomponentofhowjournalismeducationcanevolve.

SteveColl,thedeanoftheColumbiaGraduateSchoolofJournalism,describestheemergenceofinstructionindata-drivenreportingpracticesasarecognitionthatdatajournalismisaboutmorethanjustpublishingstoriesthroughdigitalmedia,butaboutdevelopingreportingmethodsappropriatetothecomplexityoftheworldtoday.

“Datajournalismandtoolslikesensorslookpowerfulbecause,incomparisontothewayjournalismschoolshaverespondedtopreviousiterationsoftechnologicalchange,thisonerunsdeep,andtotheheartofprofessionalpractice.It’snotaboutshiftingdistributionchannels,orshiftingstructuresofaudience,”Collsaid.“Itwasverytempting,inmanywaysnecessary,forjournalismschoolstorushovertotheteachingoftools,theteachingofplatforms,theteachingofchangingaudiencestructure.Butthattransformationoftenhadlittletodowiththecore,enduringpurposeofjournalism,whichistodiscover,illuminate,holdpowertoaccount,explain,illustrate.”

Journalismschools,bynecessity,adaptedmanynewtoolstorespondtothemassiveandrapidshifttodigitalmedia.Butdelvingintodatajournalismbringsjournalismbacktoitsjournalisticmissionandmovesitaheadinitsresearchmissionatthesametime,Collsaid.

“Whatwe’rereallyseeingnowisthatthisisadurablechangeinthestructureofinformation,andthereforeaneedtodurablychangeajournalist’sknowledgeinordertocarryouttheircoredemocraticfunction.Nottobuildabusinessmodel,nottoreachmorepeople,nottohavemorefollowers,buttoactuallydiscoverthetruth—youneedtolearnthis.”

Theriseofdataanalysismayalsofostercross-campuscollaboration.Journalismschools,astheyembracedataanalysiswithintheiralreadypowerfulabilitytotellstories,areuniquelysuitedtoberobustparticipantsandevenleadersindevelopingmeansofstorytellingwithdata.

Ourresearch,whichisfocusedonjournalismschools,maynotaccountforprogramswheredataanalysisiscenteredinanotherschoolordepartmentthatteachesthissubjecttostudentsthroughouttheuniversity.Forundergraduates,inparticular,thereislittlereasontoofferin-houseclassesinsubjectsthatstudentshavefreereintostudyinanotherdepartment.Yetitwouldrequireagreatdealoflatitudeandinitiativeforstudentstoconstructhybriddegreesthisway.Journalismstudentscansometimesbebetterservedbycross-departmentalinitiativesthatpairinstructorsforteamteachingandconnectjournalismstudentswithotherdisciplinesthatfocusondataandcomputation.Northwestern,Stanford,BostonUniversity,Columbia,GeorgiaTech,Syracuse,andothershaveworkedtobuildtheseinterdisciplinaryinitiatives.

TeachingDataandComputationalJournalism

29TheTaskatHand:CausesforConcernandReasonsforHope

Page 30: Teaching Data and Computational Journalism

Byestablishingtheseinterdepartmentalbridges,schoolscancreatepathwaysofcollaborationbetweenjournalism,itspartnerdisciplinesofcommunicationandmediastudies,andtheotherareasofresearchthatshareaninterestinthefutureoftechnologyandsociety.

Evenascross-departmentalworkincreases,anotherchallengeforjournalismeducationwillbetoidentifywhichdatacoursesneedtobeframedjournalisticallyandwhichotherscanbelearnedthroughclassesframedwithinthemethodologyofotherdepartments.Inordertolearnstatistics,forexample,studentsmaybeencouragedtoregisterinclassesofferedbythemath,statistics,orevenpoliticalsciencedepartment.Theprinciplesandobjectivesoftheseclassescouldapplywithinjournalisticwork,butthatmaynotalwaysbethecase.Theseclassesareoftentaughtfromaresearchortheoreticalperspective.Astatisticsclassthatemphasizessurveymethodology,forexample,couldbelessusefulforajournalismstudent.

Journalistsdonotoftenworkwithsamples,buttheydoworkwithentiredatasets.Fordatajournalismeducationinparticular,amoreusefulstatisticsclassmightbethetypeofinstructionMeyerprovidedbothincollegecoursesandinIRE/NICARbootcamps,usingsocialsciencetoaddressjournalisticchallenges.Accommodatingbothtechniquesinaresearchorstatisticsclasscouldfostercollaborationinsteadofsilos.Inotherinstances,outsourcingacoursemaymakesense.Mappingskillsnecessaryforjournalists,forexample,arethesametypesofskillsnecessaryforotherdisciplinesinacademia.

Yetthetaskofdevelopingandadoptingadatajournalismcurriculumcomeswithitsownchallenges.Thehighrateofchangeindigitaltools,platforms,andprogramminglanguagesmeansthatthereismoretoteachandthatclassesthemselvesmustbeupdatedfrequently.Itisdifficulttodecipherwhichnewtechniquesarejustpassingfadsandwhichhavethepotentialtoremainrelevantforeventenyears.Forthisreason,itisimportantforclassestobedesignedsothattheyteachdataandcomputationasfundamentalstylesofinquiry.Studentscanlearnenoughabouttheconceptsbehindatechniquetobeabletomoreeasilylearnnewtoolsthataddressthetechnique—asopposedtofocusingonthediscretetoolsusedfromtimetotime.

Thereareexceptions—theUnixcommandline,forexample,hasbeenasfundamentalandimmutableasanycomputingtool.Thisisatext-basedapplication,stillfavoredbydevelopersformanytasksonMacandLinuxsystems,forcontrollingthecomputerusingtypedcommandsinsteadofagraphicalinterface.Andmanyofitscoreutilitiesremainessentiallyunchangedsincethe1970s.YetitisfarmorecommontocitesuchexamplesastheActionScriptlanguageforAdobeFlash,whichwastaughtatseveraljournalismschoolslessthanadecadeagoandisallbutabandonedbydeveloperstoday.ThesilverliningisthatActionScriptsharesmanyfeatureswithprogramminglanguagessuchasJavaScriptand

TeachingDataandComputationalJournalism

30TheTaskatHand:CausesforConcernandReasonsforHope

Page 31: Teaching Data and Computational Journalism

Python,soitmayhaveofferedapathforastudenttodevelopotherproficiencies.Butitalsohighlightstheimportanceofselectingtechniquesforjournalismclasseswithlong-termconsiderationsinmind.

6.FrancoMoretti,Graphs,Maps,Trees:AbstractModelsforLiteraryHistory(London:Verso,2007)andDennisTenen,“BluntInstrumentalism,”inDebatesintheDigitalHumanities,forthcomingin2016,UniversityofMinnesotaPress.↩

7.Folkerts,Hamilton,andLemann,“EducatingJournalists,”p.4.↩

8.Ibid.,p.12.↩

TeachingDataandComputationalJournalism

31TheTaskatHand:CausesforConcernandReasonsforHope

Page 32: Teaching Data and Computational Journalism

Chapter2:StateoftheField:OurQuantitativeData

TeachingDataandComputationalJournalism

32Chapter2:StateoftheField:OurQuantitativeData

Page 33: Teaching Data and Computational Journalism

TheScopeofOurStudyForthisreport,wecollectedandanalyzedinformationon113journalismschools,roughlyone-quarterofthenation’sjournalismprograms,andgathered63syllabiforcoursesontopicsspanningdata-drivenjournalism,computationaljournalism,datavisualization,andothermethods.Wecombinedthatwithaseriesofin-depthinterviewswithmorethan50professorsandprofessionaljournalists(manyofwhomareadjuncts),andwespokewithtenstudentsorrecentgraduates.Wealsoattendednineclassesandparticipatedinthreemassiveopenonlinecourses(MOOCs).

Foryears,anecdotalevidencehasindicatedthatU.S.journalismschoolshavefallenbehindindatainstruction,orrather,startedfrombehindandhavenotcaughtupwiththefieldasithasbeenpracticedinnewsrooms.Akeytenetofthisfieldisthatusingdatatoreportandtellstoriescanresultinamorepowerfulstory.AsLaFleurdescribeditinherIREarticle:“understandthedata,interviewthedata,reportthedata.”Thatistheprocesswetriedtofollowforthisreport.

Wefirstcollectedthecourseofferingsof113programsaccredited(fullyorprovisionally)bytheAccreditingCouncilonEducationinJournalismandMassCommunications.Accreditationisavoluntaryprocessforjournalismschools.WeusedtheACEJMCprogramssimplybecausetheyrepresentedasignificantportionofjournalismschoolsandtheircurriculumrequirementsincludetwothatfitinwiththeconceptofprovidingdatajournalisminstruction:“applybasicnumericalandstatisticalconcepts”and“applycurrenttoolsandtechnologiesappropriateforthecommunicationsprofessionsinwhichtheywork,andtounderstandthedigitalworld.”

Wescrapedwhatwecouldfromthejournalismprogramwebsitesandhand-enteredtheremainder.Toverifythedata,wethenemailedorcalledprogramsthathadlistedeithernoclassesindatajournalismorveryfewclasses.Thisyieldedchangesinournumbersforseveralprogramswheretheonlinecoursedescriptionswerenotaccurate.Insolicitingthisfeedback,wealsoheardfrom11schoolswherethedepartmentisrevampingitscurriculumandconsideringaddingdatajournalism.Sixteenschoolsdidnotrespondtomultipleemailsorphonecalls.Wethenrevisitedeveryprogramwebsiteforall113programsanddouble-

checkedthedata.1

Wealsocollectedinformationonmultimediaofferingsofeachprogramsothatwecouldcomparemultimediacourseofferingswithdatajournalismcourseofferings.

TeachingDataandComputationalJournalism

33TheScopeofOurStudy

Page 34: Teaching Data and Computational Journalism

1.Itshouldbenotedthatinformationonadegreeprogram’swebsitedoesnotnecessarilyreflectthepresentstateoftheircurriculum.Wereachedouttoprofessorsandadministrativestaffinordertoconfirmourdata,butthiswasnotalwayspossible.↩

TeachingDataandComputationalJournalism

34TheScopeofOurStudy

Page 35: Teaching Data and Computational Journalism

OurFindingsAlittlemorethanhalfoftheuniversitieswereviewed—59ofthe113schools—offeroneormoredatajournalismcourses.Wedefinedadatajournalismclassasbeingfocusedontheintersectionofdataandjournalism,andusingspreadsheets,statisticalsoftware,relationaldatabases,orprogrammingtowardthatend.WeincludedinthedatajournalismcategoryonlythoseprogrammingclassesthatwentbeyondbasicHTMLandCSS.Forthepurposesofthisreport,weconsideredclassesonHTML,CSS,andJavaScripttobefocusedondigital/designjournalism,notdatajournalism.Wealsoexcludedcoursesinnumeracyandcommunicationsresearchmethodologiesandstatisticsunlessthecourseofferingsexplicitlyincludedajournalismfocus.Theappendixincludestablesdetailingthefullresultsofouranalysis.

ForAaronWilliams,whoisfouryearsoutofcollege,itwasnotsurprisingtohearthatouranalysisshowed54ofthe113programsdon’tofferastandaloneclassondatajournalism.WilliamshasworkedindatajournalismattheLosAngelesTimes,theCenterforInvestigativeReporting,andnowasinteractiveeditorattheSanFranciscoChronicle.AlmosteverythingheknowshelearnedfromcolleaguesatNICAR,hesaid.“Ididn’tevenreallyknowaboutdatajournalismasadiscipline,nordidmyinstructors…untilbasicallyIwasasenior,”Williamsrecalled.

Ofthe59programsweidentifiedthatteachatleastonedatajournalismclass,27oftheschoolsofferjustonecourse,usuallyfoundational.Fourteenoffertwoclasses.Just18ofthe59schoolsteachingdatajournalismofferthreeormoreclassesinthissubject.

Ataminimum,theseprogramsoffercoursesthatteachstudentstousespreadsheetstoanalyzedataforjournalisticpurposes.Attheotherendofthespectrum,someschoolsprovidefarmore,teachingmultipleclassesinprogrammingskills,suchasscrapingtheWeb,

TeachingDataandComputationalJournalism

35OurFindings

Page 36: Teaching Data and Computational Journalism

buildingnewsapps,orcreatingadvanceddatavisualizations.Butprogramswithmultipleclassesarerare.

Asignificantnumberofprogramsoffersomeinstructionindatajournalism,eveniftheydon’tprovideastandaloneclass.Ofthe113ACEJMC-accreditedprograms,69integratesomedatajournalismintootherreportingandwritingcourses,ouranalysisshowed.Inmostcases,thisentailsintroducingtheconceptsofusingspreadsheetsorbasicanalysisaspartofreportingandwritingclassesorcertaintopicclasses,suchasbusinessjournalism.

Again,tablessummarizingthesefindingscanbefoundintheappendix,whiletheremainderofthischapterwilldigdeeperintoouranalysisofsyllabiandcourseofferingsindatajournalism.

TeachingDataandComputationalJournalism

36OurFindings

Page 37: Teaching Data and Computational Journalism

TeachingDataFundamentals:RowsandColumnsDatajournalismprofessorssaythatthefoundationaldataclassisthemostimportantbecauseitlaysdownkeymindsetsandskillsthatareaprerequisiteformoreadvancedlearning.SteveDoigofASUbelievesthecoredatasyllabusshouldconsistofnegotiatingfordata,thinkingcriticallyaboutdata,andusingspreadsheetstoanalyzedata.

Itisdifficulttooverstatethevalueofspreadsheetsformanaginginformation.WhenweaskedformerCUNYprofessorAmandaHickman,nowanOpenLabseniorfellowatBuzzFeed,howshedefinesdata,shereplied,“anythingtabular.”

Forthefoundationalcomputer-assistedreportingclasses,thesyllabusanalysisandinterviewsindicatethatthecourseworkiscomprehensive,providingastrongbaseincriticalthinkingandbasicconceptssurroundingtheuseofdatatofindandtellstories.Studentsaretaughtsimilarconcepts:criticalthinkinganddevelopinga“dataframeofmind”—inotherwords,beingabletoquestiondatainadisciplinedway,makesenseofdiscrepancies,andfindtheunderlyingpatternsandoutliersthatareimportanttotheanalysis.

Mostoftheclassesincludesometypeofhands-onlearning.Manyofthemfocusfirstonspreadsheets,thenSQL,followedbymappingandstatisticalconcepts.Othersincludebasicdatavisualization,usingTableauorGoogleFusionasawayintothesubject.Multipleprofessorssaidthehands-onapproachreinforcesthecriticalthinkingconcepts,includinghelpingstudentstounderstandwhatstructureddatalooklikeandhowinformationofanykindcanbestructuredforbetterunderstanding.

Anotherkeyfeatureofthe63syllabiwereviewedwasanexerciseinrequestingandnegotiatingfordatafromagovernmentalbody.DanKeating,whoworksattheWashingtonPostandteachesalong-standingclassincomputer-assistedreportingattheUniversityofMaryland,saidthatfindingwhat“noonehaseverknownbefore”isadefiningpartofhisclass.

ManyCARcoursesbreakdownthisway:

HardSkills

Searchingforandfindingdocumentsanddatathatenablethejournalisttomakestatementsoffact,includingpublicrequests,deepresearch,andscrapingskillsUnderstandingdatastructuresandhowtocleanandstandardizedataintoaformthatis

TeachingDataandComputationalJournalism

37TeachingDataFundamentals:RowsandColumns

Page 38: Teaching Data and Computational Journalism

usefulAnalyzingdatausingspreadsheets,databases,mapping,andvisualizationLearningadvancedstatisticalmethodsthatilluminatedata

Guidingconcepts

Findingwhat“noonehasknownbefore”Developingdata-drivenstorytellingtechniques,includinghowtousenumberseffectivelyinproseandhowtotellastoryvisuallyThinkingofdataasanassetinthereportingprocess

Whetherfollowingtheguidingconceptsorapplyingthehardskills,journalismstudentstodaymustbewellgroundedinboththeimportanceofdataandthetoolstousedatainstorytelling.“Ifyoudon’tdealwithdataasajournalist,you’reshuttingyourselfdown,”saidMcGintyoftheWallStreetJournal.

TeachingDataandComputationalJournalism

38TeachingDataFundamentals:RowsandColumns

Page 39: Teaching Data and Computational Journalism

TeachingAdvancedDataSkills:VisualizationandProgrammingAdvancedinstructionindatajournalismtodayislimited.Only14ofthe113AEJMC-accreditedprogramssurveyedforthisstudyteachprogrammingbeyondHTML/CSStojournalismstudents.Andonly11ofthe113offercourseworkinemergingareasofdatajournalism,suchasdrones,virtualreality,andcomputationalmethods.

Infact,basedontheanalysisofsyllabiandjournalismprograms,evensomeclassesdescribedasadvancedprimarilyteachbasictenetsofspreadsheetuse.Partofthereasonisthatthisisstillwheretheneedisgreatest,saidprofessorsandtrainers.“ItisunbelievablehowmuchtimeIspendteachingthebasics,”saidJaimiDowdell,theseniortrainingdirectorforIRE.

However,teachingthebasicCARcurriculumisnotenough,arguedKevinQuealy,agraphicseditorattheNewYorkTimesandadjunctprofessorofjournalismatNewYorkUniversity.“Tododataworkatahighlevel,oneortwosemestersofcoursesisveryinadequate,”hesaid.

Manyjournalismprogramsofferdesignclasses,butoftenthoseclassesfocusonbasicdesigntenets,overallwebdesign,orstaticinfographics.Teachingstudentstheconceptsandskillsneededtovisualizedatainaninteractivewayortobuildawebapplicationismorerare.

TeachingDataandComputationalJournalism

39TeachingAdvancedDataSkills:VisualizationandProgramming

Page 40: Teaching Data and Computational Journalism

Notalldatajournalismeducatorsareconvincedthatdatavisualizationfornewspresentationshouldevenbeconsideredpartofadatajournalismcurriculum.However,mostagreethatitisvitaltoteachvisualizationforthepurposeofanalysis.AlbertoCairo,whoisleadinganefforttofilladatavisualizationgapinhisroleasKnightChairinVisualJournalismatMiamiUniversity,believesthatevenbasicvisualizationinstructiongoesalongwaytowardliteracy.

First,datajournalistsneedtoknowhowtodobasicexploratoryvisualanalysis,Cairosaid.Andsecond,evenjournalistswhopracticedatavisualizationneedtostartwiththeexploratoryanalysis.Theyneedtoknow—justliketheCARspecialists—howto“interview”thedata,hesaid.

Onechallengefortraditionaljournalismschools,whichmaylackastrongjournalismdesigncomponentandmayalreadyhavedifficultyteachingaCARordataanalysisclass,iswhethertheyshouldtapprofessionalsorrecruitortrainfacultytoincorporatedatavisualization.Tothat,Cairoandotheracademicsandprofessionalsweinterviewedsuggestthatsuchschoolscollaboratewithotherpartsofauniversitytofillthegap.

Forouranalysis,wedifferentiatedbetweenwebanddigitaltechnologiesaimedatpresentationandthedataskillsneededtotellastory.Thiscanbeadifficultboundaryline.Newsapplications,forexample,arefocusedondesign,but,basedonourinterviews,thereisakeydifferenceinbuildinganewwebsiteoramultimediapresentationandbuilding

TeachingDataandComputationalJournalism

40TeachingAdvancedDataSkills:VisualizationandProgramming

Page 41: Teaching Data and Computational Journalism

somethinglikeProPublica’s“DollarsforDocs,”whichenabledreaderstodrillintothestoryofpharmaceuticalindustrypaymentstodoctorsandalsomadeitpossibleforotherjournaliststofindandtellotherstories.Meanwhile,“SnowFall,”theNewYorkTimes’smuch-touted(andPulitzerPrize-winning)interactivestoryofskierscaughtinaWashingtonstateavalanche,wasn’taboutdataanditwasn’taboutfurtheringtheuseofthedata;ituseddesignskillstomakethestoryanimmersivemultimediaexperienceforthereader.

Just14journalismschoolsinourdatasetteachprogrammingbeyondHTMLandCSS,basedontheircoursedescriptions.Atpresent,theprogramminglanguagesmostoftenusedinclassesondata-drivenreportingareSQL,Python,andR.InstructorsfocusingondataanalysisoftenincorporateSQL,andsomewillintroduceR.Someinstructorsalsoteachwebframeworks,suchasDjangoandRubyonRails,andsomevisualizationprofessorsteachJavaScriptandotherskills,thoughfewergointotheD3librarydevelopedbyMikeBostock,aformerNewYorkTimesgraphiceditor.

DeenFreelon,acommunicationstudiesprofessoratAmericanUniversity,takesadifferentapproach,teaching“codeforthepurposesofanalysis”inacourseopentobothcommunicationsandjournalismstudents.“IjustgotbackfrommylastclasswhereIwasteachingstudentshowtoanalyzeTwitterdata,”hesaid.

Whileadvancedclassesarerare,thereisacleardemandforthisknowledge.Inthetechworld,shortprogramsdesignedtotrainwebdevelopershaveemergedasfinanciallyviablebusinesses.Thesecodeschoolshaveshownthatsomeoftheseskillscanbetaughtinconsiderablylesstimethanafour-yeardegree.TheLedeProgramatColumbia,whichoffersasummerbootcampaswellasanintensivetwo-semestercertificationprogramincomputationalskillsforjournalists,hasdrawnstudentsinterestedingainingkeydataskillsinashortperiodoftime.

MaggieMulvihill,aclinicalprofessorofjournalismatBostonUniversity,israisingrevenueforcomputationaljournalismeffortstherethroughholdingweek-longcampsonstorytellingwithdatafornon-journalismprofessionals.

Integratingdatajournalismexposesstudentstothefield,highlightingthisasanareathattheymightchoosetopractice,butitisalsoanimportantstepforstudentsdevelopingafoundationofjournalisticskills.Asnoted,69ofthe113AEJMC-accreditedprogramsalreadyintegratesomedatajournalismintoreportingandwritingcourses,andonthisfrontthereissomegoodnews:severalschoolsexpressedinterestinaddingdatajournalisminasystematicwaytotheirprograms.When,inordertoverifyordata,wecontactedeachoftheprogramsthathadlistedeithernodatajournalismclassorjustone,11respondedthattheyareactivelyworkingtoadddatajournalismtotheircurricula.AttheUniversityofAlabamainTuscaloosa,forexample,theschooldoesnotofferastandaloneCARclass,butitnowincludescomponentsofdataanalysisinstructioninthreeseparatejournalismclasses.

TeachingDataandComputationalJournalism

41TeachingAdvancedDataSkills:VisualizationandProgramming

Page 42: Teaching Data and Computational Journalism

AlternativeDataInstruction:TheStateofOnlineCoursesOneresponsetothewidespreadlackofinstructionindatajournalism,andinstructorscapableofteachingit,hasbeentoenlistrespectedteachersformassiveopenonlinecourses,orMOOCs.Doigisoneofthoseteachers,andhesuggestsMOOCsoffergreatbenefitforcertainclasses,providingexpertinstructionandhands-ontraining.

HewasaninstructorintwoMOOCsfocusedondatajournalism,oneorganizedbytheEuropeanJournalismCentre,whichdrew25,000peopletoenroll,andtheotherbyRosentalAlvesoftheKnightCenterforJournalisminTheAmericasattheUniversityofTexasSchoolofJournalism,whichdrewmorethan4,000.

“OnestrengthisthatthereareawidevarietyofMOOCsouttherecreatedbytopfacultyatmajorinstitutionslikeHarvardandMITandStanford,”Doigwroteinamemoonthesubject.“Theirexistencebegsthequestionofwhyshouldyourinstitutiongotothetroubleofcreatingandstaffingaclassthatcoversthesameground.(Ofcourse,onereasonwouldbetocollectthetuitionfromyourstudents!)”

HesuggestedthatapartialjournalismcurriculumcouldbecraftedoutofMOOCofferingscombinedwithvideocontentfromjournalism-relatedsourcessuchasIREandthePoynterInstitute’sNewsUniversity.However,beingunabletoprovideindividualfeedback,MOOCswouldcomeupshortforclassesinnewswritingorbasicreporting,hesaid.

OurresearchassistantparticipatedinthreeMOOCstohelpusdevelopasenseofhowwellthevirtualcoursesteachdatajournalism.HefoundthatMOOCsarebestatofferingintroductoryexposure,butoneshouldnotexpecttoreachin-depthknowledge.MOOCsmaybeusefulfordevelopinganinitialfoundationinasubject,orforreinforcingafadingproficiency,butmaybelackingintermsofteachingreportingtechniques,criticalthinking,orcreativeskills.ThethreeMOOCsheparticipatedinwereeffectiveatteachingtools,andourRAreportedthathewasoftenexcitedtolearnafeatureortechniquewithinanapplication.However,findingwaystoapplythesetoolsoutsideofexercisesmayrequireperson-to-personinteractioninaclassroomsetting.

InorderforMOOCstobeviableresources,theymustbemaintained.Manyclassesreferencedlostandoutdatedinformation.Brokenlinks,missingmaterials,andredesignedwebsitesoftenmadeitdifficulttonavigatethroughthelessons.

TeachingDataandComputationalJournalism

42AlternativeDataInstruction:TheStateofOnlineCourses

Page 43: Teaching Data and Computational Journalism

Ourparticipant’sexperiencepointedtotheissuesraisedbyDoig,buttheASUprofessordoesthinkMOOCscouldstillbeanoptionalresourceforadatajournalismcourse.“Studentseagertogobeyondwhatisofferedintheclassroom(alas,almostcertainlyaminority)canbepointedtoonlinesourcesthatwillgivethemthatcontent,”Doigwrote.“Tothatend,itmightbeagoodideatodevelopalistofMOOCsandsuchthatjournalisminstructorscouldsampleandoffertotheirstudents.”

TeachingDataandComputationalJournalism

43AlternativeDataInstruction:TheStateofOnlineCourses

Page 44: Teaching Data and Computational Journalism

Textbooks:LittleConsensusOuranalysisalsofoundonemoregapincurricula—astrongcoreoftextbooks.Theconceptsandskillsofthisfieldweredescribedinfairlyconsistentwaysthroughoutourinterviewsandthetextofthesyllabi,butdatajournalisminstructorsshareonlyafewcorebooksincommon.Infact,mostdidn’tuseatextbookatallbutprovidedalistofselectedreadings.

Ofthesyllabi,morethan70differenttextbookswererequired,buttherewasnoconsensusonwhichbookswerepreferred.Themostpopularbook—BrantHouston’sComputer-AssistedReporting:APracticalGuide—wasrequiredinjust14percentoftheclasses.

FivecoursesrequiredmembershipinIRE,and23ofthecoursesrequiredstudentstobuyabookpublishedthroughIRE.TheyincludedvariouseditionsofHouston’sandIRE’sTheInvestigativeReporter’sHandbook:AGuidetoDocuments,Databases,andTechniquesandSarahCohen’sNumbersintheNewsroom:UsingMathandStatisticsinNews.VariouseditionsofPhilipMeyer’sbook,NewPrecisionJournalismorPrecisionJournalism,wererequiredinnineofthecourses.

EightofthecoursesrequiredTheDataJournalismHandbook,whichwasproducedasacombinedeffortbydatajournalistsaroundtheglobe.TheonlinebookisaninitiativeoftheEuropeanJournalismCentreandtheOpenKnowledgeFoundationandisavailablefreeontheWebinEnglish,Russian,Spanish,French,andGeorgian.

In17classes,notextwasrequired.Thelessonheremaybethatonlinereadingworksbestfortheseclasses.Butitalsocouldmeanthatdespiteitslonghistory,datajournalismisstillanascentsubjectwithinjournalismschoolsandtheremaybeadearthofeffectivetextbooksbeyondthefewthatarecommonlyassigned.

TeachingDataandComputationalJournalism

44Textbooks:LittleConsensus

Page 45: Teaching Data and Computational Journalism

Chapter3:QualitativeFindings:InterviewsandObservations

TeachingDataandComputationalJournalism

45Chapter3:QualitativeFindings:InterviewsandObservations

Page 46: Teaching Data and Computational Journalism

IdentifyingWhattoTeach“Datajournalismisn’teasytodefineortoteach.Itisconstantlychangingandbestpracticesareevolving.Oneneedstolearnalotbydoing,too.”–JonahNewmanoftheChicagoReporter

Ourinterviewsechoedmanyofthefindingsinourquantitativedata,soratherthanrepeatthosefindings,thischapterfocusesonhowtheprofessorsputtheconceptsintopracticeintheclassroom.Itisintendedtoprovidearoadmapofexistingpedagogicalworkindatajournalismandofferinsightsintocommonchallenges.

Weinterviewednearly50teachersandpractitioners,andwhilethereisadiversityofthought,therealsoisaconsensuswhenitcomestothefoundationsofdatajournalismcurricula:criticalthinking,masteryofkeydataskills,andteachingprogrammingconceptssothatstudentswillbeabletolearnnewtoolsasneeded.

DavidBoardman,deanofTempleUniversity’sSchoolofMediaandCommunication,suggestedthatdatajournalismisaboutlearninghigherandmorecomplexlevelsofanalysis.Thisincludeslearningmoresophisticatedtoolsandsoftwareandalmostcertainlysomelevelofprogramming.

Inadatajournalismclass,havingthatcriticalthinkingskillmeansthatthestudentslearntotreatdatainanethicalway,sothatratherthanbendingthedatatorepresentaparticularview,thegoalistowardtruthandaccuracy.

“Iwouldalwayserronthe[teachingof]criticalthinkingskills,”saidLaFleuroftheCenterforInvestigativeReporting.“Thatistheharderskilltoingraininpeople.Youcanlearnhowtoclickthingsandwritealineofcode.”

Ingeneral,thosewhoteachdatajournalismfocusonhands-onmethods.Inthebeginning,theprofessorswillprovidedatatostudentstoanalyze.LaFleur,forexample,useshands-ontrainingwithonedatasetandthenwillintroduceasimilardatasetandassignthestudentstoaskthesametypesofquestions,butontheirown.

Bythemiddleofacourse,studentsoftenhavetoobtaintheirowndata,submittingpublicrecordsrequests.Thestudentsthenmoveontodatathatrequiremorecomplexanalysis.Bydoingthis,theprofessorsaredoingtwokeythings:teachingthecriticalthinkingthatgoeswithnegotiatingforinformationandunderstandingtheboundsofthatinformation.Atthesametime,thestudentsareusingbasictoolstoaccomplishtheirgoals,betheyspreadsheetsorarelationaldatabase.Insomeclasses,thefocusisonwritingamemoby

TeachingDataandComputationalJournalism

46IdentifyingWhattoTeach

Page 47: Teaching Data and Computational Journalism

theendofthecourseonapossiblestory.Inothercourses,theprofessorsexpectthestudentstoreportandwriteastory.Thislaststep—eitheramemoorastory—onceagainhelpsthestudentusecriticalthinking,thistimepairingthatwiththeskillsofstorytelling.

Thekeyislearninghowtoobtainmastery,saidIraChinoy,anassociateprofessorattheUniversityofMarylandwhopreviouslyledthedatajournalismeffortsattheWashingtonPost.

Chinoyrelatesthistothe2009“MiracleontheHudson”andhowthepilotusedreflexivemasterytolandtheUSAirwaysplaneontheriverafterbirdstrikescausedbothenginestofail.Inclass,whenstudentsgetdiscouragedaboutbadinteractionsorconversationsinpursuitoftheirdatabases,Chinoybringsup“Sully”Sullenberger’sactionsandsays,“Doyouthinkhecouldhavedonethatonhisfirstdayofpilotschool?”

Chinoyemphasizedthattheinformationshouldnotalwaysbepresentedtothestudentsupfront.Helikestogivethemachancetocomeupagainstobstacles.Theyalsoneedtodevelopasenseofwhendatacouldbeproblematic,whataresignsofthat,andwhatiseachstudent’sbestpracticeforexaminingthedata.

TeachingDataandComputationalJournalism

47IdentifyingWhattoTeach

Page 48: Teaching Data and Computational Journalism

TheCodingIssueWhetherdatajournalistsneedtoprogramremainsanactivedebate.Butwhenwedelvedintothisissue,wefoundthatwefirstneedtodefinewhatwemeanintermsofdatajournalism.Tosome,“code”meanswebdevelopmentanddesign—backtotheconceptofHTMLandCSS.“Programming”meanswritingprogramsthatenableadvancedminingofdataoralgorithmsthatcouldidentifypatterns.

Thebottomlineisthattodomoreadvanceddatajournalism,itspractitionersneed,ataminimum,tounderstandhowprogrammingworks.Thiscouldbeconsideredthestartofcomputationalthinking.JustscrapinginformationfromtheWebcaninvolvesimpleprogrammingusingPython,andunderstandingwhatispossiblewithprogrammaticsolutionsiscriticalforjournalistslookingatwebsitesandothertrovesofinformation,muchofwhichisnotjustinrowsandcolumns.

Asstudentsdeveloptheabilitytorecognizecomputationalsolutionstosomeoftheseproblems,someofthemmaythenlearnhowtoprogram.Buteventhosewhodon’ttakethecodingpathshouldstillbeabletounderstandhowsolutionslikethesecanbeapartoftheirjournalisticpractice.Theabilitytoworkwithdataandthinkintermsofcomputationisaskillbroaderandmorenecessarythananyspecifictoolorprogramminglanguage.Itisvitalthatwedon’tconfusethetwo.

MarkHansen,aprofessorofjournalismatColumbiaanddirectorofitsBrownInstituteforMediaInnovation,alsofocusesonteachingbothprogrammingskillsandthemindsetofcomputationaljournalism.Theidea,accordingtoHansen,isthatbyusingprogramming,journalistscanthinkbeyondrowsandcolumnsastheysearchforanswersindataofallforms,whetherstructuredorunstructured.

NicholasDiakopoulos,acomputerscientistattheUniversityofMaryland,hasbeenteachinganumberofclassesindatajournalismbeyondtheintroductorylevel.Andhealsoprovidesacourseoncodinginthesophomoreyear.Hisaim,hesaid,istomovetheundergraduatestudentsfromthetrackoflearningCSS/HTMLbasicwebskillstounderstandingwebdevelopmentandnewsapps.Beyondthat,he’sofferingaclassoncomputationaljournalismwithafocusonPython,textanalysisandaggregation,recommendersystems,andwritingstorieswithcodebehindit.

Diakopoulossuggestedthatstudentscouldalsotakecomputerscienceclassesiftheywanttolearnhowtobehackers—meaning,intheoriginalsenseoftheword,anyonefluentenoughwithcomputerstousethemcreatively.

TeachingDataandComputationalJournalism

48TheCodingIssue

Page 49: Teaching Data and Computational Journalism

It’sallaboutworkingwithdatainaprincipledway,Diakopoulossaid.HetiesthistoCARandPhilipMeyer’scrusadetobringthescientificwayofthinkingintojournalism50yearsearlier;inotherwords,thinkingmethodically,thinkingabouthowtoframeanexperiment,gatherdata,anduserigorousmethodstobuildevidenceofsomefindingofjournalisticimportance.

Intheend,datajournalismisaboutteachinghowtofindthestory,usinganincreasingarrayofdatatechniques,saidDavidDonald,datajournalistinresidenceattheSchoolofCommunicationatAmericanUniversityanddatadirectorofAU’sInvestigativeReportingWorkshop.“You’restilltalkingaboutstoryandhowdataneedstobevettedandbeexpressedinawaythatgetsintothepublic’sbraineasily,”Donaldsaid.“Fromtheinvestigativeside,youarelookingforevidenceinthedata.”

Developingthatcomputationalabilitywillbecomeevenmoreimportanttohandlethevastamountsofdataintoday’sworld.Moretoolswillcomeandgo,butdatajournalism,atitscore,willenablejournaliststodotheirjobinamoreexpansiveway,saidCollofColumbia.

“Ithinkthat[datajournalism]willbearoundforawhile”Collsaid.“Itwillbearoundforawholesetofiterationsofplatformsanddistributionsystems,andevenmedia.Sowegetvirtualreality,orweget3D,orwedon’t.That’sawholedifferentsetofquestions.Thisisgoingtobeabouthowyoureportongovernment,howyoureportoncorporations,howyoutellwheatfromchaff.”

Datajournalistsarestartingtoaddressthistypeofcoverage,butittakesadeeperlevelofdatajournalismcapability—thecomputationaljournalismsliceofdatajournalism.Someofitinvolvespresentingdatainnew,journalisticways.The“SurgeonScorecard,”publishedbyProPublica,isoneexample.ProPublicausedextensiveMedicaredataandcollaboratedwithleadersinthefieldtoevaluatetheperformanceofsurgeons.

Inotheriterations,thislevelofcomputationaljournalismmeansexamininginformationinnew,morecomplexways.ExamplesofthattypeofdatajournalismaretheWallStreetJournal’scoverageoftheMedicaresystem,whichreceivedthe2015PulitzerPrizeinInvestigativeReporting,andReuters’2014projectexamininginfluenceintheSupremeCourt.

TeachingDataandComputationalJournalism

49TheCodingIssue

Page 50: Teaching Data and Computational Journalism

InstitutionalChallenges:ResourcesDependingontheuniversity,somestudentsneedmoresupportwithtechnology.Somestudentsstilldonotownpersonallaptopsandrelyonschoolcomputerlabsfortheirassignments,forexample.Otherstudentsmaybeusingpersonalcomputingdevicesthatarenotequippedwithwhattheyneedtododatajournalism.Studentswhouseatablet(suchasaniPad)astheirprimarytoolwillfacebarriers.

MeredithBroussard,whotaughtdatajournalismatTempleUniversityuntil2015,saidthatensuringthatherstudentshadtheequipmenttheyneededforherclasswasamajorpriority.Manyofherstudentsreliedonatablet,whichmeantequippingcomputerlabswiththenecessaryequipmentandplatforms—orevenlendinglaptopstostudentsfortheterm.

BrantHoustonoftheUniversityofIllinoisUrbana-Champaignalsopointedtotheavailabilityofresourcesasanimportantissue—especiallyforuniversitiesthatdrawstudentsfromeconomicallydisadvantagedpopulations.

Journalismschoolscanhelpthesestudentsbyinvestinginup-to-datelabequipmentandbyworkingtocreateanenvironmentthatmakesiteasyforstudentstoaccessneededsoftwareandtoinstallitontheirowndevices.Journalismschooladministratorsshouldconsidermorefrequentauditsandsurveysofprofessorstoidentifywhichsoftwarewillbemostusefulfortheirstudents.

Andforstudentswhoareworkingontheirownpersonallaptops,someprofessorsholdprovisioningsessionstohelpstudentsinstalltheneededsoftwareatthebeginningoftheterm.

TeachingDataandComputationalJournalism

50InstitutionalChallenges:Resources

Page 51: Teaching Data and Computational Journalism

InstitutionalChallenges:FacultyExpertiseThereisnosecretthatadivideexistsbetweentheprofessionaljournalismworldandtheacademicworld.Thischasmcontinuesevenwithfacultywhenitcomestowhoteachesdatajournalismandtheimpactitwillhaveonthedepartment.

Ofcourse,eachbrandofdatajournalisminstructormayhavehisorherownbiases.Thosewhostartedasprofessionaljournalists,orwhostillworkinanewsroomandteachasanadjunct,believethattheycanconveythecriticalthinkingskillsneededtosucceedinanewsroomenvironmentmoreeffectivelythanaprofessorwhoseexperienceisinresearch.

Ontheotherhand,DiakopoulosoftheUniversityofMarylandbelievesthatfacultyshouldholdPhDs,andthatwhileitwouldbegoodtobeabletohiresomeonewith25yearsofexperienceindatajournalism,it’sanunrealisticexpectationatthisstageofthefield’sdevelopment.Hisgoal,hesaid,istoteachthinkingthroughresearch.Still,headmittedthatthisisastruggle.

SomedatajournalistsandjournalismprofessorstakeissuewithDiakopoulos,suggestingthatsuchamodelofdatajournalismprofessorswithPhDsisunrealisticinaworldwheredatajournalismemergedfromtheprofessionalpractice,notacademia.

Whereverjournalismschoolsfindthenecessaryfaculty,justhiringanewprofessortospecializeindatajournalismwillnotsolvetheproblem,saidDoigfromASU.“Onedifficultywithhavingsomebodylikemeiseverybodyelsecansay,‘Ah,wedon’thavetoworryaboutdatajournalismnow.’Inreality,Iteachmaybetwosectionsof20studentseachsemester.That’safractionofourtotalstudentload,”Doigsaid.“Sobelievingthatitissomehowbeingtakencareofbyonespecialistlikethat,thatisn’tthecase.”

Tohelpsolveatleastsomeoftheissues,Doighasprovidedshortvideotutorialstootherprofessorsforbasicgovernmentreportingclasses.

WhileDoigbelievesitwouldbegoodtohavearequireddatajournalismcourse,healsoquestionswhetherthatispossible.“Howareyougoingtofindthefacultytoteachthat?”heasked.“There’snotenoughpeopleintownwhocouldteachthat.”

Professionaltrackandacademictrackfacultymembersagreethatfornow,pullinginprofessionaljournaliststoserveasadjunctswillcontinuetobenecessaryandthatrelyingonprofessionaljournalistsalonewillnotsolvetheproblem.

ForDustinHarpattheUniversityofTexas,Arlington,thisconundrumwassolvedthroughherowninitiative.Shehadnevertaughtdatajournalismbutdecidedthestudentsneededtheclass,soshedidsomeresearchandcreatedone.Somecolleaguesaskedherwhy.She

TeachingDataandComputationalJournalism

51InstitutionalChallenges:FacultyExpertise

Page 52: Teaching Data and Computational Journalism

hastenureandnooneaskedhertotakeontheextrawork.Butthestudentsneededtheclass,Harpsaid.Sheusedlynda.comfortutorialsandlearnedthesameinformationbeforeteachingherstudents.

“ThethingisI’maqualitativeresearcher,I’mnotanumbersperson,I’mnotanumberscruncher,soitwasverycrazyanddaunting.AfterIsaidIwasgoingtodoit,itwasontheschedule,IwaslikewhathaveIgottenmyselfinto?”Harpsaid.“ButIfollowthefield....I’mawarethatdatajournalismis,it’satoolourstudentsneedtobemorecompetitivetogetjobs.”

TeachingDataandComputationalJournalism

52InstitutionalChallenges:FacultyExpertise

Page 53: Teaching Data and Computational Journalism

InstitutionalChallenges:StudentEngagementJournalismprogramsneedtodoabetterjobofpersuadingorevenrequiringstudentstotakeadatajournalismclass.Studentsmayshyawaybecausetheybelievetheyaren’tanygoodatmath.“Alotofstudentsarescaredof‘thatmaththing,’”saidonejournalismstudentatNorthwesternUniversity.

Resistancetomathisanissuefarbroaderthanthefieldofjournalism,butitwillneedtobeaddressedifteachingdatajournalismistobetakenseriously.Thisappliestobothteachersandstudents,someofwhommayhavechosentopursuejournalisminpartbecausetheythoughtthatitwouldrequirelittleornomath.

Theproblemsgodeeperthanjustconvincingpeopletheycanhandlemath.Eveninuniversitieswithentireprogramsfocusedonteachingprogramming,datajournalism,andevendatavisualization,somestudentshavereportedthatitwasn’teasytofindoutabouttheseopportunities.Someofthereasonshavetodowithsiloswithinschoolsanddepartmentsforspecificprogramsaswellasspecifictrackswithemphasisonspecifictypesofjournalisticpractice.

RichGordon,co-founderoftheKnightLabanddirectorofdigitalinnovationattheMedillSchoolofJournalismatNorthwesternUniversity,agreesthatagapexistsbetweenthebasicCARcourseandthemuchmoreadvancedprogramthroughKnightLab,whichbringsintechnologistsandworkswiththetechnologiststodevelopnewapplicationsforjournalism.JournalismstudentsgoingthroughanormaldegreeplanmayhavetheopportunitytotakeabasicCARclass,butmostwon’teverbeexposedtotheworkatthelab,hesaid.

Ingeneral,datajournalismcoursesareelectivesanddrawonlyafewstudentsoutofthetotalenrolledineachjournalismprogram.Someofthathastodowithcapacity,butanotherissueisthelackofvisibility.Often,otherprofessorsdon’ttreattheclassesasvitaltoajournalismcareer.

“There’ssomestudentinterestinCAR,”saidoneUniversityofMissourijournalismgraduate.“Buttherewouldbemoreifitwereexpressedasanoptionforstudentsearlyon.”

Universitiescanaddressthisissue,saidMikeReilley,professorofpracticeatArizonaStateUniversity,whoregardsuniversitiesas“‘toosiloed.”Reilleyadvocatesteam-taughtcoursesandcooperationbetweendepartments.

TeachingDataandComputationalJournalism

53InstitutionalChallenges:StudentEngagement

Page 54: Teaching Data and Computational Journalism

Somestudentsworktheirwaythroughtofindwhattheyneed.Forinstance,onestudentwhotookTempleUniversity’sundergraduateclassindatajournalismhadtakenclassesinprogrammingwithPythonthroughtheuniversity’sbusinessschool.Shetoldourresearcherssheplannedonlearningmoredatajournalismasshewantedtocontinuedoingthistypeofjournalismwhenshegraduated.Butinstitutionalchangescouldmakedatajournalismmuchmoreaccessible.

Severalstudentssuggestedthatschoolsshouldofferatrackthatcouldincludeajournalisticallyfocusedstatisticsclass,aclassthatfocusesondatabases,aclasswithafocusonreportingwithdata,andothersthatdelveintomorein-depthdatareportinganddatavisualization.

Oneoftheauthorsofthisreportco-taughtaspring2015watchdogreportingclasswithanengineeringprofessor.Fivecomputersciencestudentsembeddedintoprojectteamsofjournalismstudents.Thejournalismstudentslearnednewdataskills,andthecomputersciencestudentslearnedtechniquesinreportingandwriting.Theclassofferedchallenges,too.Nexttimearound,theremaybeamoredefinedwayforthejournalismstudentstotakeondatachallengesoftheirownandcontinuedemphasisonhavingthecomputersciencestudentslearnskillssuchasinterviewing.

TeachingDataandComputationalJournalism

54InstitutionalChallenges:StudentEngagement

Page 55: Teaching Data and Computational Journalism

Chapter4:ModelCurriculainDataandComputation

TeachingDataandComputationalJournalism

55Chapter4:ModelCurriculainDataandComputation

Page 56: Teaching Data and Computational Journalism

IntroductionandSummaryofCurricularRecommendationsTheprecedingchaptersofferapictureofthestateofeducationindataandcomputationaljournalismintheUnitedStates,aswellasanargumentforthenecessityandevenurgencyofjournalismschoolscommittingtoteachthesesubjects.Whatfollowsinthischapteraremodelcurriculaandguidelinesthatwehopewillfacilitatethistransition.Weintendthesemodelstobeflexible;wehopethatthisinformationcanbeappliedacrossschoolsdespitevariationintermlength,academicunits,andtimetodegree.

Thischapterisdividedintofivesections.Thefirstisamodelforanintroductoryclassindataandcomputationthatwerecommendasarequirementforalljournalismstudents.Thesecondsectionofferswaysofintegratingdataandcomputationalinstructionintocoreclassesandcertainelectives.Thelastthreesectionspresentfullmodelcurriculaforarangeofdegrees.Thefirstofthosethreeisatrackorconcentrationindataandcomputationaljournalism.Itisflexibleinordertobegenerallyapplicableattheundergraduateorgraduatelevel.Theothertwoaremodelsforadvancedgraduatework.Thefirstpresumesastudentwithsomereportingexperienceorajournalismdegreeinhandwhowishestodevelopexpertise-drivenreportingskills—thatis,towriteaboutcomplicatedsubjectsfromapositionofdeepunderstanding.Thefinalmodelisforaresearch-driven,lab-basedgraduatedegreeinemergingmediaandtechnologicalinnovation.

Aboveall,werecommendthatallprogramshavearequiredfoundationalcourseindatajournalism,teachingbasicprinciplesofdataanalysisforthepurposeoffindingstorieswhilecultivatingasenseofthegeneraltechniquesandpossibilitiesofdata-drivenreporting.Thepremiseofthiscourseisthatallreportersmustbepreparedtousedataintheirworkandtorecognizewhenthisapproachisneeded.

Consideringthatmanyjournalismprogramsaredesignedtocoveradizzyingrangeofmaterialinashortperiodoftime,someofourreadersmightbewonderingwheretofindroomforarequiredclassindatajournalism.Schoolsthathaveanexistingclassonbasicnumeracyforjournalistscouldreworktheclasstoincludeagreateremphasisondata-drivenreportingmethods.Anotheropeningmightlieinmultimediaclasses,manyofwhichwereintroducedonlyinthepastdecade.Comparedwithdatajournalism,whichframesamindsetforgatheringandpresentinginformation,multimediainstructionoftencentersonteachingtoolswithuncertainshelflives.Itmightbetimetoconsiderretiringtheaudioslideshowfromrequiredcourseworktomakeroomfordataskills.

TeachingDataandComputationalJournalism

56IntroductionandSummaryofCurricularRecommendations

Page 57: Teaching Data and Computational Journalism

Model1:IntegratingDataasaCoreClass

FoundationsofDataJournalismThisisamodelforarequiredintroductorycourseatthegraduateorundergraduatelevel.Whatfollowsisanarrativeaccountofhowsuchaclassmayproceedindevelopingdataliteracyamongbeginningjournalists.Werealizethatthiscoursemayneedtofittheuniquecontoursofdifferentjournalismprograms,someofwhichcontainbootcampsandotherintroductoryprogramswithidiosyncraticdurationsandvaryinglevelsofintensityandfocusondifferentskills.Thepointisforthiscoursetobegivenequalfootingwithotherskillsorsubjectmattersthatarecurrentlytreatedasessentialinajournalismeducation.

coursedescription:Thiscourseisanintroductiontothecollection,analysis,presentation,andcritiqueofstructuredinformationbyjournalists.Asstudentsareintroducedtothebasicsofreportingandtherangeofjournalisticmethodsthattheymaypursueinlatercoursework,anintroductiontodataandcomputationisanessentialcomponentoftheirjournalismeducation.

Overthecourseofaterm,studentsshouldbegintodevelopaframeofmindinwhichtheyapproacheverystorylookingfordatapossibilities.Theyshouldunderstandhowtousebasicmethodsusingspreadsheetsandrelationaldatabases.Theyshouldgetaprimeronusingandunderstandingstatisticalconcepts.Theyshouldlearnhowtotaketheirdatafindingsandlocatethepeoplewhoillustratethosefindingsfortheirstories.Theywilllearnhowtoconverttheirdataanalysisintoapitchforajournalisticstory.

Studentswilllearnhowtofinddataonline,howtomaintainpersonalrecordsastheyreportstories,andhowtousesimplevisualizationmethodstofindnewinformation:howkeepingatimelinecanhelprevealdiscrepanciesandhowcross-checkingsourcesofinformationmayleadtonewavenuesofinquiry.Theuseofdatainthesecontextswillbenefitstudentsnomatterwhichareaofjournalismtheychoosetopractice.Justlikeinterviewing,whichisaubiquitousjournalisticskill,theartofgatheringandunderstandingdatashouldextendwidelyacrossthefieldofjournalism.

Thetroublewithdataisthatitsooftenappearsclinicalordetachedfromtherichnessofpeople’slives.Toreducethingstoabstractionsmayseemlimitingtosomestudents.Earlyexercisesmayhelptocounterthispresupposition.Ifyouasktheclasstogatherinformationabouteachothersuchastheirbirthdates,bloodtypes,eyecolor,andbirthplace,theymayseewithina20-minuteexercisehowinterestingdatacanbewhenwelearnsomethingfromdatathatwecaretoknow.

TeachingDataandComputationalJournalism

57Model1:IntegratingDataasaCoreClass

Page 58: Teaching Data and Computational Journalism

Fromthispoint,theclassmaymovetomorejournalisticexerciseswithspreadsheets.Asstudentsbecomemorecomfortablewithspreadsheets,theclassmayturntomethodsofdataanalysissuchaspivottablesandotherplottingmethods.

Skills:Thisclassshouldpreparestudentstousespreadsheetsanddatabasestofindandtellstories.

Centralconcernsinclude:spreadsheettraining,howtofinddata,cleanit,lookforpatternsandoutliers,andquestionthebiasesandomissionsinhowitwasgathered.Instructorsmaychoosetouseanintroductorydatasetwithagoodstoryforbeginnerstofind(examplesarelistedintheappendix).

Studentsshouldalsolearnhowtocriticallyassessclaimssurroundingdata.Reportingondatamaygoastraywhenitpresumesthisinformationiscompleteandaccurate.Reportersshouldbetrainedtolookforproblemsindata.Itisnecessarytoquestioneverysourceofinformation.

Anotherfoundationalaspectofdata-drivenreportingistorecognizepatternsandanomaliesindata.Twoskillsthatshouldemergefromthisclassaretolookfortrendsandtoidentifyoutliers.Everydatapointisapossiblesourceoranecdote.

Thisclasswillintroducedatavisualization,butmainlyasameanstoexploreadataset.UsinganapproachableprogramsuchasExcel,FusionTables,orTableau,studentswilllearntodisplaydataingraphicformasaninroadtoaskingjournalisticquestions.Thegoalisnottodesignagraphicforpublication,buttographforthesakeofunderstandingthedata.Studentsmaythinkofthisasaresearchmethodorasketchpadforfurtherreporting.Instructionshouldincludediscussionofthewaysthatdifferentvisualizationmethodscanbemisleading.

Alongtheway,thisclasscancoverbasicnumeracyanddescriptivestatistics—skillsthateveryjournalistneedstoknow.Thismayincluderemindersabouthowtocalculatepercentageandpercentchange,workingwithunitsandmeasures,andevenidentifyinglargenumberslikebillionsandtrillions.Oncethosearecovered,thematerialcouldmovetostatisticsprinciplesandmethodssuchasstandarddeviationandregressionanalysis.

Topics:Datasources,importingdata,negotiatingfordata,checkingtheveracityofdata,datacleaning,usingformulasinspreadsheets,queryingdatabases,findingsocialsignificanceinthedata,writingadatastory,visualizingadatastory.

CourseStructure:Mixofhands-onpracticeandlectures,primarilyusingspreadsheettoolsandperhapsrelationaldatabasesoftware;somelimitedexposuretodatavisualizationforstoryexploration.

ExampleAssignments:

TeachingDataandComputationalJournalism

58Model1:IntegratingDataasaCoreClass

Page 59: Teaching Data and Computational Journalism

Homework:Bringapieceofdatajournalismtocritiqueinclass.Homework:Findadatasetandexplainwhyit’sinterestingandwhatitmightreveal.Classwork:Discussbasicdataanalysisandcleaningonpreparedexampledata.Spreadsheetassignment:Analyzeagovernment’spayroll,includingovertime,orexamineacityorcountybudget.Thiscouldbeabridgeinspectiondataset,acitybudget,orcitypayroll.Final:Produceadatastoryinthreeassignments:pitch,draft,finalsubmission.

TeachingDataandComputationalJournalism

59Model1:IntegratingDataasaCoreClass

Page 60: Teaching Data and Computational Journalism

Model2:IntegratingDataandComputationintoExistingCoursesandConcentrations

GeneralGuidelinesfortheUndergraduateandGraduateLevelsThebasicprinciplesofdatajournalismshouldbeasfamiliartostudentsaswritingalede,shootingb-roll,ortweetingupdatestoadevelopingstory.Tointegratedataskillsintojournalisminstructionmeansintroducingtheseconcernsacrossthecurriculum.

Ourcentralrecommendationisforjournalismschoolstotreatdataandcomputationascoreskillsforallstudents.Datajournalismmustbetaughtasafoundationalmethodinintroductoryclasses,adistinctthemeinmedialawandethics,areportingmethodsuitabletoanyspecializedreportingcourse,andasubjectinwhichinterestedstudentscanpursueadvancedcourseworkoraconcentration.

Moreover,becausedataandalgorithmsareincreasinglyimportanttopicstounderstandinordertoreportonissuesinbusiness,politics,technology,andhealth,amongothers,subjectareareportingclassesshouldincludematerialthatpreparesstudentstoapproachtheseinformationsourceswithproperskepticismandtoexplainthemclearlyinwriting.Inthemodelsthatfollow,wepointtoafewwaysthatdatajournalismcanbeintegratedintocoursesthatarecommonlyofferedinjournalismschools.

Onenotabledifferencebetweengraduateandundergraduateprogramsisthatamaster’sprogramoftenbeginswithabootcampinwhichstudentsarequicklybroughtuptospeedonawiderangeofskills.Forthemajorityofstudents,whoenterwithoutadeclaredconcentration,abootcampmaypointtowardareasofunexpectedinterest.Tointegratedataandcomputationaljournalismintograduateprograms,itmustbegivenequalfootingalongsideotherareaswherestudentsmaychoosetospecialize.Anintroductorymoduleondatajournalismwillbenefitstudentsasmuchaslearningthebasicsofphotojournalism.Moreover,thematicelectivecourseworksuchasenvironmentalandpoliticalreportingshouldintegratedatainstructiontothesamedegreethatitwouldemphasizesuchdistinctapproachesasphotojournalism,broadcast,andlong-formjournalism.

Introductoryjournalismclassesarenecessarilybroad.Someclassesarethematic,coveringmaterialfromthebasichistoryandgeneralpracticesofjournalismtotherangeoftechnologiesandreportingtechniquesthatconstitutethemodernmedia.Othersfocus

TeachingDataandComputationalJournalism

60Model2:IntegratingDataandComputationintoExistingCoursesandConcentrations

Page 61: Teaching Data and Computational Journalism

entirelyonthepracticeofjournalism.Eitherway,dataandcomputationmusthaveaplacefoundationalcourses.

Attheundergraduatelevel,thisshouldapplytostudentspursuingeitheramajororaminorinjournalism.Courseworktowardtheminoralsoshouldintegratesomemeasureofdataandcomputationalinstruction.

Schoolsmayalsoconsiderworkingmorecourseworkindataandcomputationforotherprogramsandconcentrations.Studentsfocusedoninvestigativereporting,forinstance,wouldbenefitfromadditionalcourseworkonfindingstoriesindata,perhapsevenasanadditionalrequirement.

IntroductoryandRequiredJournalismClassesIntegratingDataandComputation

BasicGraphics,Video,andMultimedia

Howandwhytointegratedata:Differentschoolsmayteachavarietyofvisualtoolsundertheheadingofgraphic,video,multimedia,ordigitalmedia.Thereareproductivewaysfordataandcomputationtobeintegratedintotheselessons,howevertheclassesarestructured.Datavisualizationwoulddovetailwithinstructioninothergraphicalstorytellingmethodssuchasdesignandvideo,forinstance,whileageneralfamiliaritywithnewsappscouldbedevelopedinmultimediaclasses.

Skillstointegrate:Simpletoolsforbuildingcharts,maps,andtimelines.Includebuildingmapsandbasicdatacharts,visualizationsandtimelines,plusanoverviewonnewsapps.

Possibleassignments:

Usesimpletools(GoogleFusion,CartoDB,orEsri’sStoryMaps)tolocatetheavailabilityofapublicserviceacrossageographicarea.Usesimpleonlinechartingtoolstoillustratechangesintheannualbudgetsofseveralgovernmentoffices.Includedatavisualizationwithinavideotoprovidecontextandenhancethestory.

MediaLawandEthics

Howandwhytointegratedata:Legalconsiderationsformoneofthecoreconcernsofdatajournalists:makingpublicrecordsrequestscanbeoneofthemostfruitfulavenuesforreporting,butalsooneofthemostfrustrating.Journalismstudentsshouldlearntherelevantpublicrecordslawsatthestateandfederallevels.

TeachingDataandComputationalJournalism

61Model2:IntegratingDataandComputationintoExistingCoursesandConcentrations

Page 62: Teaching Data and Computational Journalism

Theyshouldalsoaddressthecommonmisconceptionthatdataissterile,objective,orinsomesensedetachedfromhumanexperience.Onthecontrary,alldataexistsbecausesomeonehaschosentogatherit,andtheuseofdatahassocialandethicalconsequences.

Coursesshouldincludematerialontheverificationofphotos(throughmetadataorcrowdsourcing)andethicalconsiderationssurroundingleakedorsensitivedata,aswellassourceprotectionanddigitalsecurityinconditionsofpervasivesurveillance.

Skillstointegrate:Becomingfamiliarwitharangeofethicalquestionssurroundingtheuseofdata.Scrutinizingdataforbias,errors,andincompleteness.

Possibleassignments:

Prepareacriticalresponsepaperonlegalandethicalconcernssurroundingleakeddata.Thiscouldtaketheformofanessayorevenamockeditorialrespondingtoasensitivestory.FileaFreedomofInformationAct(FOIA)orotherpublicrecordsrequest,thenfollowupwithneedednegotiations.Thismaybeframedaspreparationforaprojectinasubsequentterm,ifandwhentherecordscomethrough.

HistoryofJournalism

Howandwhytointegratedata:Understandinghistoryisespeciallyvaluableduringtimesofapparentchange.Toobservethefieldofjournalismevolvingoverthecenturiescanmakejournalismstudentsmoreconsciousparticipantsintheprocessofinventingitsfuture.Itmayalsohelptotemperthewidespreadviewthatjournalismiswitnessingunprecedentedupheavalduetotechnology.Lookingback,weseethatinstitutionscomeandgo,newtechnologiesareoftendisruptivebeforesettlingintoroutine,andthemissionandpracticeoftheprofessionareperenniallyunderrevision.Dataandcomputationareinmanywaysemblematicofourtime,butnotexclusivetoit.Thesetopicshavealonghistoryinjournalism.Thisclassneedstotellthatstory.

Twodistinctstrandsofhistoricalconcernshouldbecovered.Oneistorecountthehistoricalusesofdatainthenews.Forexample,astrikingandmemorableearlycaseofdata-drivenjournalismdatestotheantebellumperiodintheUnitedStates,whenHarrietBeecherStowecompiledtheaccountsofseveralescapedslaves,aggregatedadvertisementsfromSouthernnewspapersofferingrewardsfortheirreturn,andpublishedseveraltablesofdataasarebuttaltoclaimsthathernovelUncleTom’sCabinhadexaggeratedtherealityofslavery.Likewise,onemightpointtoPhilipMeyer’suseofdatatoundermineracialstereotypesinthecoverageofthe1967Detroitriots.Thesetwocaseshighlighttheenduringvalueofdataforassertingtruthsthatmightotherwisebedenied.Morebroadly,wherethese

TeachingDataandComputationalJournalism

62Model2:IntegratingDataandComputationintoExistingCoursesandConcentrations

Page 63: Teaching Data and Computational Journalism

storiesplacedatajournalisminhistoricalcontext,itwillnotonlyformacanontoorientstudentsinthisareaofpractice,butitwillalsorevealthatdatajournalism,forallitsglamorousnovelty,isrootedinatraditionofqualitywork.

Skillstointegrate:Acquiringasenseofhowthejournalisticprofessionhasdevelopedovertime,especiallyintermsofhowjournalistshavechosentodepicttheworldtotheiraudiences.Appreciatinghowdataandcomputationaljournalismfitintohistoricalcontext.

Possibleassignments:

Homework:Findandanalyzeachart,graph,map,orotherdatavisualizationpublishedinanewspaperatleast50yearsago.Termpaper:Consideracontemporaryconcernsurroundingemergingtechnology,suchasalgorithmictransparencyortheSnowdenleaks,inthecontextofotherhistoricalcases.

AdvancedClassesandElectives:IntegratingDataandComputation

InvestigativeReporting

Howandwhytointegratedata:Manyofthetoolsandmethodsofcomputationalanddata-drivenjournalismweredevelopedthroughinvestigativereporting.Fluencywithspreadsheets,databases,andothermainstaysofcomputer-assistedreportingwillenablestudentstoconductdeepinvestigationswiththefullrangeofresourcesattheirdisposal.

Skillstointegrate:Compilingthebackgroundsofpeopleandorganizationswiththeuseofdata.Turningdocumentsintodata.Makingpublicrecordsrequestsandnegotiatingfordata.

Possibleassignments:

Tracingshellcompanyownershipthroughpublicrecords.Examiningmedicaldevicereportsforproblemsindevicessoldbyspecificcompanies.

NarrativeReportingandFeatureWriting

Howandwhytointegratedata:Greatfeaturewritingisbuiltonfactsandcompellingnarratives.Thiscourseshouldincorporatesomedata-drivenandcomputer-assistedreportingmethods,teachingstudentstoframe,explain,andgivecontexttodatathatwillhelptotelltheirstory.Thisclassshouldhighlightthatwordsandnumbersarebothsourcesofdata.Theinstructormayconsiderinvitingaguestlecturefromaprofessorinthedigitalhumanitiestohighlightnovelapproachesdevelopedinthisfieldforunderstandingliteratureandtheartsthroughacomputationallens.

TeachingDataandComputationalJournalism

63Model2:IntegratingDataandComputationintoExistingCoursesandConcentrations

Page 64: Teaching Data and Computational Journalism

Skillstointegrate:Generalgraspofusingnumberstosupportanarrative.Usingspreadsheetstoorganizechronologiesofthemaincharactersinthecourseofreporting.Usinglarge-scaletextualanalysistoolstoorganize,index,andannotatedocuments.

Possibleassignments:

UseOvervieworDocumentCloudtoexplorealargecacheofdocuments,suchastheCongressionalRecord,Wikipedia,orarecentleak.Organizereportingforalong-formnarrativepiecebyplacingsources,quotes,andchronologiesinaspreadsheet.Analyzetaxreturn(IRSForm990)dataonartsnonprofitstoevaluatetheirfinances.

SocialMediaSkills

Howandwhytointegratedata:Theuseofsocialmediabycontemporarynewsorganizationsgoeshandinhandwiththeuseofanalyticstodrivetraffic.Ifstudentsaretaughttorunsocialmediafeeds,theyalsoshouldbetaughttounderstandtheanalyticsfortheseplatforms.Moreover,theabilitytominethesocialwebtointerpretsocialtrendsandpublicopinionwillbeanassetinreporting.

Skillstointegrate:Gatheringandinterpretingwebanalytics.Scrapingorotherwiseaggregatingsocialmediacontentforanalysisuseinastory.

Possibleassignments:

UseTwitteranalyticstodeterminetherateofgrowthinfollowers,retweetingactivity,orthemostpopularstories,sections,writers,anddaysoftheweek.UseGoogleanalyticstoaggregateseveralstreamsoftrafficdataandgeneratemorecomplicated(second-order)insights.UseGoogleTrendstodoastoryonpatternsinsearchdata.Analyzesocialmediadatatoproducechartofattentionaroundarecentnewsevent.(Advanced)UsescrapedTwitterdatatotellastory(perhapsthroughsentimentanalysis).

BusinessandEconomicReporting

Howandwhytointegratedata:Theabilitytogather,analyze,andcritiquefinancialdataisanessentialcomponentofbusinessreporting.Manyclassesalreadyincludesomeinstructiononreadingandinterpretingdata.Asmoreofthisdatahasbecomegenerallyavailable,whilesomeofithasbecomemorecomplicatedanddifficulttointerpret,businessreportingclasseswillneedtoadaptandoffermoreadvancedinstruction.

TeachingDataandComputationalJournalism

64Model2:IntegratingDataandComputationintoExistingCoursesandConcentrations

Page 65: Teaching Data and Computational Journalism

Skillstointegrate:Theabilitytogatherandanalyzedatafromavarietyofsources,includingBloombergterminalsandAPIs(applicationprogramminginterfaces)forfinancialinformation.Advancedspreadsheetanalysisandfinancial/budgetanalysistraining.

Possibleassignments:

Spreadsheetassignment:findastoryinacompany’spublicfinancialstatements.BuildapersonaldashboardofAPIstotrackfinancialinformationforastory.AnalyzewhetheryoucanpredictearningsorstockpricethroughafactorlikeCEOsalary.

DigitalDesignandVisualCommunication

Howandwhytointegratedata:Digitaldesigncoursesinjournalismschoolsservetointroducestudentstolayoutdesign,editorialgraphics,andtheprinciplesofvisualcritique.Inordertointegratedataandcomputation,suchacourseshouldincludematerialondatavisualizationandatleastanintroductiontotheideaofnewsappsandwebdevelopment.

Skillstointegrate:Basiccharts,graphs,andmaps.Avisualcritiquetoknowwhichstylesofvisualizationaregoodforwhichkindsofdataandtopinpointcasesinwhichvisualformscanconcealordistortthedata.

Possibleassignments:

Findadatavisualizationyoulike,thendissect,explain,analyze,andcritiqueit.Findsomedata,identifywhat’sinterestingaboutit,andvisualizeyourfindings.Mockup(design,don’tprogram)anewsapp.

GlobalandInternationalReporting

Howandwhytointegratedata:Whenjournalistscoverothercountries,numberswilloftenhelpboththemandtheiraudiencetopicturetheseunfamiliarandoftencomplicatedmatterswithgreaterclarity.Aglobalreportingclassshouldteachstudentstofind,assess,andaccuratelyconveyfactsandfiguresaboutforeigncountriesandsubjectswithaninternationalscope.Onadeeperlevel,suchaclassshouldteachstudentstofindstoriesbygatheringandscrutinizingdatafromglobalsources.

Skillstointegrate:Howtogather,evaluate,andusedatafrommultipleinternationalsources.Howtoevaluatewhatdatacancommunicateaboutinternationaldevelopmentpatterns.Howtousedatatocompleteaninvestigativeprojectfocusedonaninternationalissue.

Possibleassignments:

TeachingDataandComputationalJournalism

65Model2:IntegratingDataandComputationintoExistingCoursesandConcentrations

Page 66: Teaching Data and Computational Journalism

UseadatasetfromalargeinternationalorganizationsuchastheUNtofindastory,thenlearnhowtheorganizationgathereditsdataanddiscussthelimitationsandbiasesthatmayresult.Finddatathatdeepensyourunderstandingofaninternationalstoryinthenews,complicatestheprevailingnarrative,orrevealsanothersideofit.

ScienceandEnvironmentalReporting

Howandwhytointegratedata:Dataisacrucialcomponentofscientifictopicsinthenews.Theabilitytointerpretresearchpapersandscrutinizeexperimentalmethodswillmakestudentsfarbetterreportersonthesesubjects.Studentsshouldemergefromthisclasswithanunderstandingofthescientificmethod,randomizedcontrolledexperiments,statisticalsignificance,andotherfactorsthattheywillencounterwhilereportingontopicsinscienceandtheenvironment.Ifpossible,theyshouldalsohavetheopportunitytousetheirowndatasources,suchassensorsforairorwaterquality.

Skillstointegrate:Howtogather,evaluate,andusedataonspecializedscientifictopics,andtocriticallyassesspublishedresearch.

Possibleassignments:

Readingandanalyzingdatastoriescoveringthesetopicsandreverseengineeringhowthereporterstoldthisstory.Draftingdataanalysisofkeydatasetsandstorypitchmemos.Readingaresearchpaper,evaluatingtheevidence(includingthestatisticalargumentsused),andsummarizinginplainlanguageforanon-technicalreader.Settingupasensornetworktotestairqualityacrosscampus(classproject).

TeachingDataandComputationalJournalism

66Model2:IntegratingDataandComputationintoExistingCoursesandConcentrations

Page 67: Teaching Data and Computational Journalism

Model3:ConcentrationinData&ComputationAdatajournalismconcentrationshouldbeginwithseveralcore,requiredclassesbeforemovingintoatrackofelectivesofferingdatajournalismanalysis,visualization,andonlineresearch/backgrounding.

Thecurriculumdetailedbelowshouldprovideaframeworkforaschooltobeginofferingspecializedcourseworktostudentswhowishtoconcentrateindata-drivenreportingorcomputationaljournalism.

Thissectiondescribessomeofthecoursesthatmayformsuchadegree.Dependingontheavailabilityofinstructorsandotherresources,classeslikethesemayformeitherthemandatorycoreofaconcentrationindataandcomputation,orelsearangeofelectives.

Pleasenotethatwewouldnotexpectanyjournalismschooltoofferalloftheseclasses,noronlythese,initsdataandcomputationalcurriculum.Thisisjustonepictureoftheskillsandthematicexposurethatcouldconstituteajournalismdegreespecializingindataandcomputation.

CoreClassesRequiredforConcentrationinData&Computation

FoundationsofDataJournalism

Thisisthecourseoutlinedintheopeningofthischapter(seefulldescriptiononpage50)asarequirementforalljournalismstudents.Ifstudentsenterjournalismschoolwithoutdeclaredconcentrations,thisintroductionwillbesuitableforfuturedataconcentratorstolearnthebasicsbeforeproceedingtootherrequiredcoursesandelectives.Schoolsmayalsochoosetorequireapplicantstobespecificallyacceptedintothedataconcentration,inwhichcaseitmaybeadvisabletoofferasummerbootcamp(see“NoteonIncomingSkills,TechnicalLiteracies,andSpecializedBootCamps,”page74)togetstudentsuptospeedonthetoolsandmethodstheywillneed.Inthiscase,dataconcentratorsmaybeplacedinamoreadvancedfallfoundationscoursewiththeirpeers.

IntroductiontoJournalisticProgramming

TeachingDataandComputationalJournalism

67Model3:ConcentrationinData&Computation

Page 68: Teaching Data and Computational Journalism

Coursedescription:Thepurposeofthiscourseistointroducestudentstoseveralfoundationalcomputer-programmingskillsthattheywillusetofindandtellstories.Thisshouldbearequirementofthosewhoconcentrateindataandcomputation,butalsoopentostudentsfromothertracks.

Coursestructure:Meetstwiceweekly,firstforlectureandthenforanintensiveworkshop.

Skills:TheUnixcommandline;basicPythonprogrammingforscraping,parsing,connectingtoAPIs;introductiontoJavaScriptforwebwork.

Tools:Bashutilities,Jupyter/IPythonNotebook,Pandas,Matplotlib,JavaScript.

Exampleassignments:

Testproficiencywiththecommandlinewithaquiz,orevenascreencastdemonstratingcompletionofaseriesoftasksusingBashalone.StoryassignmentreportedandsubmittedinJupyter/IPythonnotebook.

StatisticsforJournalism

Coursedescription:Themethodsandprinciplesofstatisticshaveproventobepowerfultoolsinthehandsofjournalists.Thiscourseshouldbearigorousintroductiontostatsworktaughtfromwithinaframeworkofjournalisticconcerns.Thatmeansthecourseisstory-based,inthesenseofprecisionjournalismandtheCARtradition.

Coursestructure:Weeklylectureswithin-classexercises,regularhomework,andafinalexam.

Skills:Developingandtestinghypotheses;understandingandapplyingthecentrallimittheorem,normaldistribution,andconfidenceintervals;FrequentistversusBayesianstatistics;linearregression;analysisofvariance.

Tools:RStudio,Excel,MySQL,MicrosoftAccess,SAS,SPSS(proprietary)orPSPP(F/OSS).

Exampleassignments:

Analyzecrimestatistics,lookforatrend,andtrytoexplainitscause.Lookatthedistributionofcancercasesandtrytodecideifthereisevidenceofanincreaseinmorepollutedareas.AnalyzestatisticalevidenceforU.S.andinternationalcasestopredictwhetherreducingthenumberofgunswouldhaveaneffectongunviolence.Analyzethestatsinaresearchpaperandreporttheminplainlanguage.

TeachingDataandComputationalJournalism

68Model3:ConcentrationinData&Computation

Page 69: Teaching Data and Computational Journalism

DistributionofElectivesFortheconcentration,theschoolmayofferelectivecoursestofulfillrequirementsintwoorthreeareasofdataandcomputationalwork.Wehavedividedtheseintothreecategories:presentation/visualization,analysisforstory,andjournalisticprogramming.Asamatterofdesigningdegreerequirements,aprogrammightchoosetorequireatleastoneclassfromeachcategoryinadditiontofulfillingoverallcreditrequirements.

presentation&visualization

DataVisualizationVisualJournalismwithDataandComputationAdvancedDataVisualizationAdvancedJournalisticMapping

analysisforstory

WritingAboutDataStatisticalAnalysisforJournalismAdvancedComputationalReportingMethods(UsingCAR)

journalisticprogramming

IntroductiontoJournalisticProgrammingMethodsofCollectingDataandAutomatingReportingNewsAppDevelopmentAdvancedComputationalJournalism

ElectiveCourseworkGraduateDegreewithConcentrationinData&Computation

MethodsofCollectingData&AutomatingReporting

Coursedescription:Thiscoursefocusesondevelopingexpertiseingatheringdata,cleaningit,storingitinadatabase,andretrievingitwithease.Italsoemphasizesbuildingautomatedtoolstoserveasdatasourcesinreporting.

Coursestructure:Weeklyworkshoporlab-basedinstruction.

TeachingDataandComputationalJournalism

69Model3:ConcentrationinData&Computation

Page 70: Teaching Data and Computational Journalism

Skills:Webscraping,APIs,cronjobs,bashscripting,digitizingpaperdocuments,regularexpressions,parsingtextanddata,fuzzystringmatching,recordlinkage,contentanalysis.

Tools:Python,BeautifulSoup,Mechanize,Scrapy,Tabula,SQL,MongoDB,dataformats(CSV,JSON),Tesseract(OCR),Twitterbots.

Exampleassignments:

Classwork:DesignGoogleAlertstomonitorsubjectsofinterest.homework:WriteaprogramtoscrapetheCongressionalRecordforeverythingaparticularrepresentativehassaidontheflooroftheHouse.Homework:WriteawebscraperinPythonandautomateitwithacronjob.Homework:BuildawebapporTwitterbottopostusefulinformationfromanAPI.Groupproject:Buildasensornetworktoautomaticallyposttemperatureorairqualitymeasurementsonline.Finalproject:Gatherausefulbodyofdata,previouslyunavailable,andshareitpublicly.

VisualJournalismwithDataandComputation

Coursedescription:Thiscoursecoversarangeofmethods,media,andformatsforthegraphicpresentationofinformation.Readingsshouldintroduceprinciplesofvisualdesignandintegratetheseintoregularassignments.BeginningwithafairlybasicprogramlikeTableau,theclassshouldhighlighttheeffectiveandaccuratepresentationofinformationingraphicform.Bythemiddleoftheterm,studentsshouldbranchoutintousingaprogramminglibrarysuchasD3todesigntheirowngraphicsoutsidetheconstraintsofexistingsoftware.

Coursestructure:Weeklyseminartodiscussreadings,followedbyhands-onworkshop.

Skills:Datavisualization,newsapps,GIS/mappingforpresentation.

Tools:Tableau,JavaScript,D3,QGIS,CartoDB.

Exampleassignments:

Homework:UseTableautofindastoryinapreviouslyunexploreddataset.Finalproject:Createanoriginalvisualizationorinteractivepieceprogrammedbyhand(presumablyinD3orevenpureJavaScriptifitwastaughtinanearlierclass).

AdvancedDataAnalysis&JournalisticAlgorithms

Coursedescription:Thiscourseshouldbuilduponthecore,requiredclassestobringtogetherdataandcomputationforfindingstoriesandmakingpredictionsusingalgorithmicandcomputationalanalysis.

TeachingDataandComputationalJournalism

70Model3:ConcentrationinData&Computation

Page 71: Teaching Data and Computational Journalism

Coursestructure:Weeklylectureandworkshopwithregularhomeworkandafinalproject.

Skills:Pythonformachinelearning,clustering,classifyingdocuments,standardizingandmatchingalgorithms.

Tools:R,Python(Pandas,MatPlotLib,SciPy,scikit-learn),clusteringalgorithms(k-means,k-nearestneighborclustering),topicmodelingalgorithms(LDAorNMF).

Exampleassignments:

Classwork:Recordlinkagefordatacleaning,forexample,analyzeFederalElectionCommissiondatatofindtopdonors,whichrequiresregularizationofnames,bestdonewithmachinelearning.Homework:AnalyzeStateoftheUnionspeechessince1790tomakeavisualizationofhowkeytopicshavechangedovertime.homework:Implementclusteringtodetectoutliersinadataset.Finalprojectoption:Buildanelectionormarketpredictionmodel.Finalprojectoption:Reverseengineerapricing,lending,orcreditscorealgorithm.

AdvancedDataVisualization

Coursedescription:Thiscoursewouldpickupfromthedatavisualizationskillsdevelopedinthecorecourseinvisualjournalism.Thetechnicalaspectsshouldbeconductedentirelythroughprogramming.ThemostlikelytoolsareJavaScriptandD3,butotherswillcertainlyemerge.Thekeypointisthatdatavisualizationatthislevelshouldbeprogrammaticsothatthesoftwareitselfdoesnotlimitdesignpossibilities.

Coursestructure:Itmaybedesignedtoalternatebetweenseminars(high-levelreading,discussion,andanalysisofvisualcommunicationandinformationdesignprinciples,focusingonhowitismosteffectiveandwhereitcanbemisleading)andlabclasses(advancedpracticalinstructioninapplicationandcodingframeworksforinfodesign).

Skills:Designingforclarity,precision,impact.

Tools:D3,JavaScript,oranothersuitableprogrammingframework.

Exampleassignments:

Homework:Regulardataassignmentsindifferentmedia:staticweb,video,interactive.Finalproject:Anoriginalanalysisofunexploreddata,presentedinanoriginalvisualizationprogrammedmoreorlessfromscratch,withcross-platformconsistency.

AdvancedJournalisticMapping

TeachingDataandComputationalJournalism

71Model3:ConcentrationinData&Computation

Page 72: Teaching Data and Computational Journalism

Coursedescription:Thiscourseshouldbuildonpreviouscourseworkinmappingtocovermoreadvancedmanipulationsofdata,todevelopahigherdegreeofdesignsophistication,andtodevelopahighlevelofnewsjudgmentintheselectionoftimely,compelling,andoriginaltopics.ThisinvolvesusingGIStechnologies,joiningthatspatialdatawithotherinformation,usingdensityandotherspatialanalysistoinformstories,notjustbuildingpresentations.

Coursestructure:Hands-onworkshopandlab.

Skills:Clustering,binning,heatmaps,joiningdifferentgeographicdatasets.

Tools:BothGISanalysissoftwareandpresentationsoftware,includingEsri,QGIS,CartoDB,Leaflet,sensorslikeDustDuino.

Exampleassignments:

Homework:Weeklypitchesandjournalisticmappingassignments.Finalproject:Aninteractivemaporsetofmapstellingastoryaboutatimelyorunexploredsubject,and/oranarrativestoryusingfindingsfromthemappinganalysis.Classproject:Buildasensornetworkorotherwiseamassanunexploreddataset,thenworkinsmallgroupstobuildapackageofmapstoexplorethedata.

AdvancedJournalisticTextMining

Coursedescription:Textisdata.Thepurposeofthisclassistoteachjournaliststogather,analyze,andpresentstoriesusinglargeamountsoftextualdata.Thismaybuildonmaterialfromthecourse“MethodsofCollectingDataandAutomatingReporting.”

Coursestructure:Weeklylectureandlab,workingtowardafinalproject.

Skills:Webscraping,analyzinglargebodiesoftext,sentimentanalysis,topicmodeling.

Tools:Overview,DocumentCloud,NaturalLanguageToolkit(NLTK)orStanfordNLP.

Exampleassignments:

Homework:Usesentimentanalysistoreproducethebeforeandaftertonechangeofastory.Finalproject:BuildascrapertocrawlasignificantchunkoftheWeb,forexample,collectingtheblogosphereofacountrythat’sinthenewsandlearningwhatpeoplearetalkingabout.Finalproject:Gatherandanalyzealargebodyofdocuments,suchaslookingforastoryinaleakedcacheofdocuments.

AdvancedComputationalJournalism

TeachingDataandComputationalJournalism

72Model3:ConcentrationinData&Computation

Page 73: Teaching Data and Computational Journalism

Coursedescription:Thiscourseshouldreflectthestateofcomputationaltoolsinjournalisticpracticewhilelookingtowardnovelapplicationsofemergingandunexploredtools.Bythistime,studentsshouldhavealreadydevelopedastrongfoundationofprogramminganddataanalysisskills.Thisclassshouldbuildonthatfoundationandencouragein-depth,independentprojectscenteredonreportingstoriesordevelopingapieceofsoftware.

Coursestructure:Meetstwiceweekly,onceforlectureandonceforlab.

Tools:Python,Ruby,orasimilarlypowerfulandversatilescriptinglanguage.Additionally,physicalcomputingtoolslikeArduino.

Exampleassignments:

Homework:UseregularexpressionstominetheCongressionalRecordforasenator’sstatedpositionsonapoliticalissueoverthecourseofhisorhercareer.Mockcodinginterview:Inthecommonstyleofinterviewingfortechjobs,solveagivenprogrammingproblemonawhiteboardandnarrateyourlineofthinking.Finalproject:Anin-depthstoryreportedusinganadvancedtool,includingbutnotlimitedtoajournalisticalgorithm,machinelearning,analysisofpersonallygathereddata,ordevelopmentofapieceofsoftware.

CapstoneorThesisProject

Athesisindatajournalismwillarise,ideally,fromworkwithinstructorsandanadviser.Itcouldtaketheformofareportedstory,atechnicalreport,apieceofsoftware,orasubstantialdesignpiecesuchasamapordatavisualization.

Acapstoneprojectforconcentratorsindataandcomputationmaytakeaclassofstudentsandcoordinateaprojectusingandhoningtheskillstheyhavedevelopedintheirearliercoursework.Eachstudent’sworkshouldthenbesupplementedwithanindividualcontributionsuchasareportedpieceordatavisualization.

TeachingDataandComputationalJournalism

73Model3:ConcentrationinData&Computation

Page 74: Teaching Data and Computational Journalism

Model4:AdvancedGraduateDegree:Expertise-DrivenReportingonData&ComputationWhilemostundergraduateandmaster’sprogramsaredesignedtoofferjournalisticnewcomersasetofskillsthatcanbeappliedtoawiderangeofsubjects,thereisalsodemandformid-careerjournaliststoreturnforcourseworkinwhichtheycandevelopdeepexpertisetoreportoncomplicatedsubjects.Ashighlytechnicaltopicshavecometopermeatemattersfrominternationalpoliticstoeverydaylife,journalismschoolsmaywishtoofferclassesthatpreparestudentsandequipmid-careerjournaliststoreportonsuchissuesascyberwar,databreaches,andcryptocurrencythatrequirespecializedskillwhenwritingforageneralaudience.Theusesofdata,machinelearning,andcomputationalmodelsmayalsoaidthesereportersinfindingandtellingthesestories.

Dataandcomputationaljournalismareidealsubjectsforamid-careerdegreebecauseofthetimeandmentorshipthatcouldbedevotedtodevelopingthissetofskills.Sinceitisdirectedatstudentswhoalreadyknowhowjournalismworks,thisdegreecouldprovidealevelofdepthandfocusthatmaybedifficulttoreachduringastandardjournalismprogramorwhileworkingafull-timejob.

Courses

FoundationsofData-DrivenJournalism

(asdetailedabove,butadaptedtothelevelofadvancedstudents)

ReportingAboutData

Coursedescription::Thegoalofthiscourseistopreparestudentstounderstandandcriticallyassessreports,studies,scholarlywork,andotherinformationsourcesthatarebasedindataandtechnicalwork.Thiswillbeanessentialskillaseachstudentdevelopsafocusasanexpertreporteronatopiccenteredondata,computation,technology,ortheexperimentalsciences.

Coursestructure:Smallseminarfocusedonthediscussionandanalysisofreadingsandcasestudies,culminatinginalongformpiece.

exampleassignments:

TeachingDataandComputationalJournalism

74Model4:AdvancedGraduateDegree:Expertise-DrivenReportingonData&Computation

Page 75: Teaching Data and Computational Journalism

responsepapers:Weeklyanalysisandreflectiononclassreadings.termpaper:Asubstantiallong-formreportingprojectonatopicofthestudent’schoosing,possiblydevelopedasanoutgrowthofaweeklyresponsepaper.

Thesis

Coursedescriptionandstructure:Seminarinwhichstudentsdevelopindependentreportingprojects,sharingprogressduringclassandmeetingregularlywithanadvisertobuildtowardamaster’sthesis.

Electives

Oneofthegoalsofamid-career,expertise-drivendegreeinjournalismisforstudentstodevelopadeepunderstandingofthefieldtheyarereporting.Tothisend,thisdegreeshouldofferseveralelectiveslotsfortakingclassesinotherdepartmentsthatcontributedirectlytothesubjectofthethesis.

Studentsmayalsoconsiderauditingcourseswithskillrequirementsabovetheirlevel(forexample,ifassignmentsmustbesubmittedintheCprogramminglanguage,whichisstillthecaseinsometraditionalcomputerscienceclasses).

exampleelectivesandjustification:

Anearthscienceorgeologycoursefocusedonclimatedata.Adigitalhumanitiescoursethatusescomputationaltechniquestoexplorehistoricalarchives,literaryworks,orleakedcachesofdocuments,tonamejustafewexamples.Anynumberofcomputersciencecoursesinwhichstudentscouldlearnthetechnicalbasisandacademicconcernssurroundingissuesofinterestintheirreporting,suchascomputervisionorcryptography.Acourseindigitalsecuritycouldhelpajournalistnotonlytoprotectsensitivesources,butalsotoreportonsuchmattersaspublickeyencryptionoronionrouting,andtoassessnewdevelopmentsinthesefields.Agraduatecourseinstatisticalmodeling,whethertakeninthestatisticsdepartmentorinaquantitativesocialsciencesuchassociology.

Thepointofelectivecoursesshouldbetopermitstudentstocraftacourseworkplanthatissuitabletotheirownuniqueinterestsastheydevelopthecapacityforexpertise-drivenreportinginsomearearelatedtodata,computation,andemergingtechnologies.

TeachingDataandComputationalJournalism

75Model4:AdvancedGraduateDegree:Expertise-DrivenReportingonData&Computation

Page 76: Teaching Data and Computational Journalism

Model5:AdvancedGraduateDegree:EmergingJournalisticTechniquesandTechnologiesInvestigativereportingisinmanywaystheresearchanddevelopmentwingofjournalism.AccordingtoBrantHouston,“It’stheonlyplacewherepeoplehavehadanextensiveamountoftimetotryoutnewtechniques.”

CAR,datajournalism,andcomputationaljournalismaresomeoftheclearestexamplesofthisphenomenonatwork.Thesepracticeshavedevelopedwherereportershavehadthetimeorinclinationtoworkwithnewtoolsandplatforms.Universitiesareideallysuitedtocultivatethisstancetowardjournalisticpractice—notmerelyteachingthewisdomofthefieldasitexists,butdevelopingentirelynewapproachesbasedonencounterswithotherdisciplinesandunexploredtools.

Ifjournalismschoolsweretotakeupthemantleofencouragingworkthatseemstohappenonlyunderthesepermissiveconditions—notjustthroughgrantsandinnovationlabs,butperhapsthroughcourseworkaswell—thenuniversitiescouldalsoactasR&Dlabsinawaythatinvestigativereportinghasinthepast.

Thiscurriculumisinmanywaystheleaststructuredandmostspeculativeoneweoffer.Itisanopenquestionwhetherthesedegreesshouldbeofferedatthemaster’sordoctorallevel.Onemightalsoaskwhetherthedegreeshouldrequireanycourseworkorsimplyprovideanopenplatformforresearch.

StructureandTopicsOneobjectiveforthisprogramwouldbetohelpteachalgorithmsforjournalism,machinelearning,andartificialintelligence.

Dronesandvirtualrealityaretwoplatformsthatarebeingactivelyexploredfortheirjournalisticpotential.MattWaiteestablishedtheDroneJournalismLabattheUniversityofNebraskapreciselytoexplorethejournalisticapplicationsofthesedevices.Likewise,theBrownInstituteatColumbiahassponsoredseveralMagicGrantstosupportteamsofjournalistsexploringthenarrativepotentialofimmersivevirtualreality.

Manyotheremergingtechnologieshavebeenrecognizedfortheirjournalisticpotential.Atthetimeofourwriting,immersivevirtualrealityheadsetsseempoisedtoenterthemarkettoenthusiasticreception.Augmentedrealitypresentssimilarpossibilities:broadcastjournalists

TeachingDataandComputationalJournalism

76Model5:AdvancedGraduateDegree:EmergingJournalisticTechniquesandTechnologies

Page 77: Teaching Data and Computational Journalism

maysoonarriveinourlivingroomsasholograms.

Thepointisnottospeculateonthearrivalofthesedevices,nortopromoteinnovationforitsownsake,buttoconsidertheroleofjournalismschoolsindevelopingandshapingtheuseofnewdevices.

SettingandGearforEmergingTechnologyLabsAlthoughmanycodinganddesignprojectsmayrequirenothingmorethanalaptop,avarietyofhardwareshouldbeonhandforstudentsinterestedinexperimentingwithsensorsandotherhardwarethatcanbeusedforjournalisticprojects.

Ideally,cheapdevicesandcomponentscanbeprovidedonanhonorsystem,andmoreexpensivegearcheckedoutthroughanequipmentroom.AthrivingexampleofthismodelistheInteractiveTelecommunicationsProgramatNewYorkUniversity.Theprogramalsodesignatesafewshelvesfordonatingusefulscrapmaterials,suchasoldelectronicstobedismantledforcomponentsorsheercuriosity.

AWordonSafetyMostinnovationlabswillfeatureatleastonetoolordevicethatrequiressafetytraining.Mostjournalismstudentswillarrivewithouthavinghadexperiencehandlingsolderingironsorelectricalwiring.

Anylabthatincludesthesedevicesmustprovidesomesafetyinfrastructure.Solderingironsshouldbeusedwithsomemeansofventilation.Firehazardsrequireanearbyextinguisher.Andmanycircumstancesmayrequiresafetyglovesorgoggles.

WhenAmandaHickmanarrivedattheBuzzFeedlabinSanFrancisco,oneofherfirsttaskswastoevaluatesafety.TheBuzzFeedteamhaspurchasedsafetygogglesandfireextinguishers,forexample,becauseitisworkingwithsawsandsolderingirons.

Tinkeringequipmentcanbequitecheap,butanytechnologylabmustcoversomebasics.Thesecomponentsarethebreadandbutterofhackerandmakercircles,sotheyareeasytofind.Becausetheyoffersuchusefulinroadstoexperimentingwithtechnology,theyarevaluableforjournalismschoolstocultivatespacesofinnovation.

Mostelectricalprototypingstartswithasolderlessbreadboard,aflatplasticcasewithanunderlyinggridofconnectionsorbuildingcircuits.Asimpleelectronicdevicelikeanairqualitymetercanbebuiltfromscratchbyplacingcomponentslikewires,resistors,knobs,

TeachingDataandComputationalJournalism

77Model5:AdvancedGraduateDegree:EmergingJournalisticTechniquesandTechnologies

Page 78: Teaching Data and Computational Journalism

buttons,andsensorsacrossthegrid.AndconnectingabreadboarddevicetoasimplecomputerlikeanArduinooraRaspberryPienablesuserstoissuecommandsandgatherdatafromtheequipment.Astartersetforsuchaprojectwouldgenerallyrununder$100,farlessthancamerasandotherequipmentthatjournalismstudentsareoftenrequiredtopurchase.

Beyondthesmallcomputersusedforprototyping,moresubstantialcomputersshouldbeonhandforprojectsthatcallforit.Ifpossible,anemergingtechnologieslabshouldhavemachinesthatallowstudentstogainfirsthandexperienceworkingwithnews-boundtechnologysuchasimmersive3Dcameras,VRheadsets,anddronesinsteadofrelyingonsecondhandaccounts.Theseskillsandliteraciesfitintoalargerconstellationoftechnicalconcernsthatmaygiverisetomediainnovationalongunforeseenpaths.

TeachingDataandComputationalJournalism

78Model5:AdvancedGraduateDegree:EmergingJournalisticTechniquesandTechnologies

Page 79: Teaching Data and Computational Journalism

Chapter5:InstitutionalRecommendations

StepsTowardBringingDataandComputationintoYourJournalismSchool

TeachingDataandComputationalJournalism

79Chapter5:InstitutionalRecommendations

Page 80: Teaching Data and Computational Journalism

FacultyDevelopmentandRecruitmentFormanyjournalismschools,integratingspecializedcourseworkindataandcomputationwillpresentsomethingofachicken-and-eggconundrum.Thereisprofessionaldemandfordatajournalistsinpartbecausetheyarerelativelyscarce,sowhileschoolsmaywishtopreparetheirgraduatesforthisemergingfield,thefielditselfmaynotyethaveenoughteachersinitsranks.

Butspecializedcourseworkwillmeetonlysomeoftheneedhighlightedinthisstudy:dataandcomputationmustalsocontinuetobeintegratedintomanyclasseswhereitisnowneglected,frombootcampstocapstonesandtheses.Itisworthrecallingthateditorsonceresistedtheuseofphotographyasajournalistictool.Journalismschoolsmustpreparestudentstobringdataandcomputationtoanystorythatneedsit.

Thismayrequiremanyjournalismfacultytobetrainedtoworkwithdataandcomputationaltools.Attheveryleast,journalisminstructorsshouldbeconsciousofwhenastudent’sworkmaybenefitfromdata,evenifthestudentmustgoelsewherefortargetedinstruction.Schedulingguestlecturesmayalsoserveasatransitionalsolution.

TeachingDataandComputationalJournalism

80FacultyDevelopmentandRecruitment

Page 81: Teaching Data and Computational Journalism

TrainingsorModulesIn2014,NICARintroduceddatajournalismexercisesforacademicsinterestedinteachingdatajournalismbutinneedofalittlehelp.Thesepromisetobeusefultojournalisminstructorswhowouldliketoteachdata.

ThebenefitofNICARandotherjournalismtrainingorganizationscouldgowellbeyondthemodules,though.Newtoolsaredevelopingquickly,anditiscriticalforfacultytocontinuetogrow,learn,andchangeasthefielditselfdevelops.

Infact,NICARfilledavacuumthatexistedbecausemanyacademicinstitutionsdidn’taddressnewtoolsorskills.Meanwhile,personalcomputingtoolsbecamemorepowerful,digitalinformationsourcesbecamemorecommonplace,andnewsorganizationsincreasinglyreliedondigitalmethodsofgatheringanddistributingnews.Justasprintnewspaperswereslowtorecognizethepowerofdata-drivenreportingandtheInternet,sotoohavejournalismschoolsbeenreluctanttochange.Teachinginstitutionsmustadaptorriskbeingunabletofulfilltheirgoalsandmission,bothtotheirstudentsandtotheprofession.

TeachingDataandComputationalJournalism

81TrainingsorModules

Page 82: Teaching Data and Computational Journalism

IncomingSkills,TechnicalLiteracies,andBootCampsManygraduateprogramsinjournalismreadilyenrollstudentswithlittletonopriorexperienceasreporters.Thereisanimplicitassumptionthattheirundergraduateworkwillprovideafoundationtobeginlearningtothinklikeareporterandproducestoriesinavarietyofplatforms.

Withdataandcomputationaljournalism,though,theremaybemoresubstantialgapstobridgeintermsofmathskillsandtechnicalliteracies.Oftentimes,studentsmustlearntouseavarietyofunfamiliarsoftwareinordertoevenbeginworkingwithdata,statistics,andprogramminglanguages.Asitstands,itshouldbefairlystraightforwardtoteachtheaveragejournalismstudenttothinkaboutdata,tofindstoriesinaspreadsheet,andeventothinkcriticallyaboutthenumbers.Reportershavealwaysneededtoseeinsidecomplicatedissuesandtoasktoughquestionsinordertogetthestoryright.

Mathandtechskillsmayrequireextratime.Thisskillgapcouldbeamelioratedwithasummerbootcampthatfocuseslargelyonbuildingskills,tools,andtechnicalliteracies,whiledeferringinstructioninreportinguntiltheregulartermbegins.Thisway,whenstudentsentertheirregularmaster’scoursework,theywillbeequippedwithsomefluencyinthedataandcomputationaltoolsthattheywilluseasconcentrators.

Inthecasethatdataconcentratorsgothroughanextendedbootcamp,itmaybeappropriatefortheirfallintroductiontodatajournalismclasstobeseparateandmoreadvancedthanthedatajournalismcoursethatisrequiredofallstudents.Foranexampleofhowthiscourseworkcouldbestructured,wehavelistedtheofferingsofColumbia’sLedeProgramintheappendix.

TeachingDataandComputationalJournalism

82IncomingSkills,TechnicalLiteracies,andBootCamps

Page 83: Teaching Data and Computational Journalism

TechnologyInfrastructureManycollegesanduniversitiesprovidecomputerlabsandstudiosforclasses.Theprimaryadvantageisthecertaintythateachstudentwillhaveaworkstationwiththenecessarytechnicalspecificationsandsoftwareinstalled.Theprimarydisadvantageisthatstudentsmaygraduatewithoutthetoolstheyneedtopracticetheskillstheyhavelearned.

Althoughtheirnewsroomworkstationcouldpotentiallybeoutfittedwiththetoolstheyneed,ifanyofthosestudentsbecamefreelancers,theymaybeoutofluckiftheyleftclasswithoutbringingalongthemthetoolstheylearnedinschool.

ProvidingserverspaceforstudentsisagreatwaytobeginteachingthemtheUnixcommandlineandtoprovideresourcesfordata-intensiveprojects.Butseveralinstitutionalconcernsarise.Schoolsarerequiredbylawtomaintaintheconfidentialityofstudentdata,andsothesecurityofstudentserversmaybecomeaconcern.Studentsmightinsteadbeginbyworkingonvirtualmachinesusingaprogramsuchasthefree,cross-platformVirtualBoxinordertobecomeacquaintedwithrunningamachinefromthecommandline.

TeachingDataandComputationalJournalism

83TechnologyInfrastructure

Page 84: Teaching Data and Computational Journalism

BenefitsofDistanceorOnlineLearningUsingMOOCsincomplementaryfashionwithdatajournalismcoursescouldhelpprofessorsintegratenewskillsintowhattheyoffer,saidDoigfromASU.

Inaddition,distancelearningandvirtualclassroomsmayprovidestructureandsupportthatMOOCslack.Journalismschoolsmayconsidercoordinatingpartnershipsinwhichstudentscross-enrollinspecializedcourseworkandtaketheclassoveravideostream.Thestudentwouldparticipateinclass,submitwork,andreceivecreditlikeanyotherstudent.Thisapproachcouldfillcourseworkgapsincaseswhereitisotherwisedifficulttofindaninstructor.

Stanton,thefounderofForJournalism.com,offersacautionaryword:maintainingonlinecoursesisaproblemforanyprogramthatproducestutorialsorscreencasts.Withoutupdating,thevalueoftheofferingsdiminishesquickly,Stantonsaid.TheForJournalism.comtutorialonbuildingawebframeworkwithDjangoisbasedonanolderversionoftheopensourcesoftware,forexample.

Stantonsuggeststhatuniversitiescreateaconsortiumofuniversitieswhereeachparticipatinguniversitywouldtakeownershipofspecifictopicsinwhichithadexpertfaculty.Itwouldcreatelabstoprovidethetechnicalinstructioninthoseareasandofferscreencasttutorialsonthebasics.Theneachschoolcouldbuildonthatfoundationallearninginprojectsspecifictotheirprograms.

TeachingDataandComputationalJournalism

84BenefitsofDistanceorOnlineLearning

Page 85: Teaching Data and Computational Journalism

FosteringCollaborationJournalismschoolsshouldbuildcollaborativepartnershipswithotherdisciplines.Manyprofessionalschools,journalismincluded,havetendedtooperateassiloswithinuniversitiesbecausetheydrawtheircultureandconcernsfromafieldofpracticeratherthanatraditionofacademicdiscourse.Thatstancemustshiftbecausejournalismitselfisshifting.Asaresult,weshouldrecognizethatjournalismisnotanarrowsetoftraditionalnewsroomskills,butinsteadencompasseswhatevertoolsandmethodshave,inonewayoranother,beenmadejournalistic.Practitionersofdata-drivenandcomputationaljournalismhavethrivedbyembracinginterdisciplinarityintheirwork.Severaljournalismschoolshavebeguntobuildbridgeswithcomputersciencedepartmentsbyopeningresearchcenters,co-teachingandcross-listingclasses,andevendevelopingjointdegreeprograms.Thisisapromisingstart.Notonlywilljournalismschoolsbenefitfromactingasleadersininterdisciplinarycollaboration,buttheyalsoshouldbenaturallysuitedtothisroleasafieldsituatedattheintersectionofmanyotherdisciplines.

TeachingDataandComputationalJournalism

85FosteringCollaboration

Page 86: Teaching Data and Computational Journalism

NoteonSpecialistFacultyinDataandComputationAnintegrateddatajournalismcurriculumpresentsauniquechallenge.Inthestateofthefieldasithasdevelopedandexiststoday,datajournalismisusuallyalonecourse,orelementofacourse,taughtbyonespecialistinstructor.Often,theinstructorisaprofessionaljournalistworkingasanadjunctmorefortheloveofspreadingthewordthanforthemoney.Toachieveafullyintegratedcurriculum,theoverallfacultyatjournalismprogramswouldneedtocommittochange,andadministrationswouldneedtofostertrainingforfaculty.Thechangeneedstobebroad.Thereshouldnotbeasinglefacultymemberjugglingalltheclassesindataandcomputationalskills,norshouldguestlecturesfromthatfacultymembersufficeinbroadeningtheclasstoaccountfordata.Journalismschoolsmustcommittotheideathattheycannottraininformationprofessionalstoworkinanincreasinglycomplicatedworldofinformationwithoutdevelopingthesecrucialliteracies.Itmustbeintegratedacrosstheboard.

TeachingDataandComputationalJournalism

86NoteonSpecialistFacultyinDataandComputation

Page 87: Teaching Data and Computational Journalism

Appendix

Tablesfromouranalysis

ClassesOfferedbySubjectatACEJMC-AccreditedJournalismPrograms

DataJournalism

NumberofClasses NumberofPrograms PercentofTotal

Noclass 54 48%

Oneclass 27 24%

Twoclasses 14 12%

Threeormoreclasses 18 16%

ClasseswithDataJournalismasaComponent

NumberofClasses NumberofPrograms PercentofTotal

NoClasses 44 38%

OneClass 31 27%

TwoClasses 22 19%

ThreeClasses 9 8%

FourorMoreClasses 7 6%

Multimedia

NumberofClasses NumberofPrograms Percentoftotal

Noclasses 20 18%

Oneclass 31 27%

Twoclasses 12 11%

Threeclasses 16 14%

Fourormoreclasses 34 30%

TeachingDataandComputationalJournalism

87Appendix

Page 88: Teaching Data and Computational Journalism

ProgrammingBeyondHTML/CSS

NumberofClasses NumberofPrograms PercentofTotal

NoClasses 99 88%

OneClass 6 5%

TwoClasses 5 4%

ThreeorMoreClasses 3 3%

Note:Thisanalysisofprogrammingclassesisfocusedonthosecoursestaughtwithinajournalismprogram.Itshouldbenotedthatafairnumberofschoolspointedtocollaborationswithotherdepartmentswherejournalismstudentswereabletotakeadvancedprogrammingorcomputerscienceclasses.

NotableStoriesBelowwelistseveralexamples,forreference,ofstoriesthatareemblematicofthecategorieswedefineinChapter1.

DataReporting

“DruggingOurKids,”SanJoseMercuryNews,2014“MethadoneandthePoliticsofPain,“TheSeattleTimes,2012

DataVisualizationandInteractives

ProPublica’s“DollarsforDocs,”2010TheWashingtonPost‘svisualizationofthemissingMalaysianjet,2014

EmergingJournalisticTechnologies

DroneExamples:

“Tanzania:InitiativetoStopthePoachingofElephants,”CCTVAfrica,2014BecauseofregulatoryissueswiththeFederalAviationAdministration,theuseofdronesforjournalismisnotwidespreadinspiteofsignificantinterestonthepartofindustryandacademia.Usesforeseenwhenregulationsbecomemorepermissibleincludenewsphotographyandvideography,scanningnewslocationsforusein3Dmodelsand360-degreevideoapplications,remotelysenseddatagatheringthroughvisibleimagesormultispectralimages,mappingofareasofinterestathighertemporalresolutionsthan

TeachingDataandComputationalJournalism

88Appendix

Page 89: Teaching Data and Computational Journalism

currentlyavailableandassensordistributorsorsensor-baseddatagatherers.

SensorExamples:

WNYC’sCicadaTrackerprojectin2013recruitedinterestedlistenerstousesensorstoidentifywherecicadaswouldemerge.USAToday’s“GhostFactories”investigationin2012usedX-raygunsensorstoscanthesoil.TheHoustonChronicle’s2005investigativestory“InHarm’sWay”usedsensorstoexamineairqualitynearoilrefineriesandfactories.

VirtualandAugmentedRealityExamples:

TheNewYorkTimessentoutmorethanamillionGoogleCardboardkitstosubscribersin2015asitlauncheditsfirstVRstory,“TheDisplaced,”apiecedetailingchildrendisplacedbywar.StanfordUniversity’sDepartmentofCommunication,hometotheStanfordVirtualHumanInteractionLab,hasscheduledaVRclassforthewinter2016quarteraspartofitscurriculumforitsmaster’sinjournalismprogram.

ComputationalJournalism

StoryExamples:

The2014WallStreetJournalinvestigationintoMedicare“TheEchoChamber,”a2014ReutersinvestigationintoinfluenceattheSupremeCourt

PlatformExample:

PDFrepositoryDocumentCloudorOverview,developedbyJonathanStray

Tools,ResourcesandMethodsDiscussedintheReportTheethicsofsoftwaremayalsoshapedecisionsaboutthetoolsandtechniquesyouteach.“Free”softwareislicensedinanefforttopromotefreedomofcomputing,inamanneranalogoustofreedomofspeech.Freesoftwaremaybecopied,altered,used,andsharedfreely.Arelatedformofsoftwarelicensing,titled“opensource,”isverysimilartofreesoftware,butinsteademphasizesthepublicavailabilityofcode.

Proprietarysoftwaremayalsohavecertainadvantages.Oftentheinterfacedesignismorepolished,supportservicesareprovided,andinsomecasestheysimplyrunbetterondemandingtasks.

TeachingDataandComputationalJournalism

89Appendix

Page 90: Teaching Data and Computational Journalism

Butthegapbetweenfreeandproprietarysoftwarehasbecomenarrowerinrecentyears,andmanyprofessionalsinfactprefertousefreeandopensourcesoftwareonmorethanideologicalgrounds.F/OSSsoftwareisoftenmoresecurebecauseitcanbeopenlyvettedbysecurityresearchers.Forthesamereason,particularlypopularapplicationsmayhavemanytalentedanddedicateddevelopers,aswellasasupportcommunityoffellowusersratherthancallcenteroronlineservice.

Giventheexpenseofproprietarysoftwareanditsinevitableobsolescence,therearefewadvantagestousingtheseapplicationsindataandcomputationclassesinsteadoffreeandopen-sourceones.

GuidetoCommonToolsforDataandComputationalJournalismThefollowinglistofcommontoolsfordataandcomputationaljournalismisquotedfromtheLedeProgramatColumbia.

ProgrammingLanguages

Cisaheavy-liftingprogramminglanguagethatisthelanguageofchoicefortheComputerScienceDepartment.It’sfarfasterthanPythonorJavaScriptandintroducesyoutothenitty-grittyofcomputerscience.

Gitissomethingcalledaversioncontrolsystem—it’snotaprogramminglanguage,butprogrammersuseitoften.Versioncontrolisawayofkeepingtrackofthehistoryofyourcode,alongwithprovidingastructurethatencouragescollaboration.GitHubisapopularcloud-basedservicethatmakesuseofgit,andwemakeheavyuseofitduringtheLedeProgram.

HTMLisn’ttechnicallyaprogramminglanguage,it’samarkuplanguage.AHyperTextMarkupLanguage,tobeexact.HTMLisusedtoexplainwhatdifferentpartsofwebpagesaretoyourbrowser,andyouuseitextensivelywhenlearningtoscrapewebpages.

JavaScriptisaprogramminglanguagethat’sinchargeofinteractivityontheWeb.Whenimageswiggleorpop-upsannoyyou,that’sallJavaScript.ThepopularinteractivedatavisualizationframeworkD3isbuiltusingJavaScript.

Pythonisamultipurposeprogramminglanguagethatisathomecrunching,parsingtext,orbuildingTwitterbots.WeusePythonextensivelyintheLede.

Risaprogramminglanguagethatisusedwidelyformathematicalandstatisticalprocessing.

TeachingDataandComputationalJournalism

90Appendix

Page 91: Teaching Data and Computational Journalism

ToolsforDataandAnalysis

BeautifulSoupandlxmlaretoolsusedfortakingdatafromtheWebandmakingitaccessibletoyourcomputer.

D3isaJavaScriptlibraryforbuildingcustomdatavisualizations.

IPythonNotebooksareaninteractiveprogrammingenvironmentthatencouragedocumentation,transparency,andreproducibilityofwork.Whenyou’redonewithyouranalysis,you’llbeabletoputyourworkupforeveryonetosee—andcheck!

NLTK(NaturalLanguageToolkit)isaPythonlibrarybuilttoprocesslargeamountsoftext.Whetheryou’reanalyzingcongressionalbills,Twitteroutrages,orShakespeareanplays,NLTKhasyoucovered.

OpenRefine(previouslyGoogleRefine)isdownloadablesoftwarethathelpsyousortandsiftdirtydata,cleaningittothepointwhereyoucanstartyouractualanalysis.

Pandasisahigh-performancedataanalysistoolforPython.

QGIS(geographicinformationsystem)isanopen-sourcetoolusedtoworkwithgeographicdata,fromreprojectingandcombiningdatasetstorunninganalysesandmakingvisualizations.

Scitkit-learnisaPythonpackageformachinelearninganddataanalysis.It’stheSwissArmyknifeofdatascience:itcoversclassification,regression,clustering,dimensionalityreduction,andsomuchmore.

Webscrapingistheprocessoftakinginformationoffofwebsitesandmakinguseofitonyourcomputer.Alotoftimesdocumentsaren’teasilyavailableinaccessibleformats,andyouneedtoscrapetheminordertoprocessandanalyzethem.

DataFormats

AnAPI(applicationprogramminginterface)isawayforcomputerstocommunicatetooneanother.Forus,thisgenerallymeanssharingdata.We’llbecodingupPythonscriptstotalktoandrequestdatafrommachinesaroundtheworld,fromTwittertotheU.S.government.

CSVs(comma-separatedvalues)arethemostcommonformatfordata.It’saquickexportawayfromExcelorGoogleSpreadsheets,andyou’llfindyourselfworkingfromCSVsmoreoftenthananyotherformat.Although“comma-separated”isinthename,aCSVcanarguablyalsousetabs,pipes,oranyothercharacterasafielddelimiter(althoughthetab-separatedonecanalsobecalledaTSV).

GeoJSONandTopojsonarespecificallyformattedJSONfilesthatcontaingeographicdata.

TeachingDataandComputationalJournalism

91Appendix

Page 92: Teaching Data and Computational Journalism

JSONstandsforJavaScriptObjectNotation,andit’saslightlymorecomplicatedformatthanaCSV.Itcancontainlists,numbers,strings,sub-items,andallsortofcomplexitiesthataregreatforexpressingthenuanceofreal-worlddata.DatafromanAPIisoftenformattedasJSON.

SQL(StructuredQueryLanguage)isalanguagetotalktodatabases.You’llsometimesfinddatasetsinSQLformat,readytobeimportedintoyourdatabasesystemofchoice.

TechTeamReportAnotherusefulresourceforunderstandingthetoolsofdatajournalismwaspreparedatStanfordbyaninterdisciplinaryteamofcomputerscienceanddatajournalismstudentsinaSpring2015courseonwatchdogreporting.Thereportisavailablehere:http://cjlab.stanford.edu/tech-team-report/

Resources

OnlinecoursesandMOOCs

DoingJournalismwithData:FirstSteps,SkillsandTools(http://datajournalismcourse.net/)SchoolofData(http://schoolofdata.org/)TheKnightCenterforJournalismintheAmericasoffersanumberofMOOCsasdistancelearningforjournalists(https://knightcenter.utexas.edu/distancelearning)

UsefulDataSetsforClassworkandAssignments

Babynamecensusdata—cleandata,alwaysvariesfromyeartoyear,papersalwayscoverit(Top1000babynamesbyyearcanbefoundathttps://www.ssa.gov/oact/babynames/limits.htmlGreenhousegasdata(NOAAhasanumberofsearchabledatasetsathttp://www.esrl.noaa.gov/gmd/dv/data/)StudentgradedistributionsforyourcollegeThisisasmalldatasetusedinalotoftheSchoolofDataExamples:TheGRAINdatabaseoflandgrabs(http://datahub.io/dataset/grain-landgrab-data/resource/af57b7b2-f4e7-4942-88d3-83912865d116)WorldBankOpenData(http://data.worldbank.org/)TheGuardianDatabases(http://www.theguardian.com/news/datablog/interactive/2013/jan/14/all-our-datasets-

TeachingDataandComputationalJournalism

92Appendix

Page 93: Teaching Data and Computational Journalism

index)TheEurostatDatabases(http://ec.europa.eu/eurostat/help/new-eurostat-website)UKGovernmentDatabases(https://data.gov.uk/data/search?res_format=RSS)NationalandInternationalStatisticalServicesbyregionandcountry(https://en.wikipedia.org/wiki/List_of_national_and_international_statistical_services)GlobalHealthObservatoryDataRepository(http://apps.who.int/gho/data/node.homeBusinessRegistryDatabases(https://www.investigativedashboard.org/business_registries/)Google’slistofPublicData(http://www.google.com/publicdata/directory#)OpenSpending(https://openspending.org/)Datahub(http://datahub.io/)OpenAccessDirectory(http://oad.simmons.edu/oadwiki/Main_Page)DataPortals(http://dataportals.org/)NASA’sDataPortal(https://data.nasa.gov/)

PhilipMeyer’srecommendedtextsJohnTukey,ExploratoryDataAnalysis(UpperSaddleRiver,NJ:PearsonEducation.1977)JamesA.Davis,TheLogicofCausalOrder(ThousandOaks,CA:Sage,1985)RobertP.Abelson,StatisticsasPrincipledArgument(Hillsdale,NJ:LawrenceErlbaumAssociates,1995)

DataJournalismArticles,Projects,andReadingListsUsedinInstruction

MOOCExamples:

Cairo,Alberto.“RecommendedResourcesforMyInfographicsandVisualizationCourses.”Personal.TheFunctionalArt:AnIntroductiontoInformationGraphicsandVisualization,October11,2012.http://www.thefunctionalart.com/2012/10/recommended-readings-for-infographics.html.“Cameroon—CameroonBudgetInquirer.”AccessedSeptember23,2015.http://cameroon.openspending.org/en/.Downs,Kat,DanHill,TedMellnik,AndrewMetcalf,CoryO’Brien,CherylThompson,andSerdarTumgoren.“HomicidesintheDistrictofColumbia—TheWashingtonPost.”News.TheWashingtonPost,October14,2012.http://apps.washingtonpost.com/investigative/homicides/.

TeachingDataandComputationalJournalism

93Appendix

Page 94: Teaching Data and Computational Journalism

“FindMySchool.Ke.”AccessedSeptember23,2015.http://findmyschool.co.ke/.Keefe,John,StevenMelendez,andLouiseMa.“FloodingandFloodZones|WNYC.”News.WNYC.AccessedSeptember23,2015.http://project.wnyc.org/flooding-sandy-new/index.html.Kirk,Chris,andDanKois.“HowManyPeopleHaveBeenKilledbyGunsSinceNewtown?”Slate,September16,2013.http://www.slate.com/articles/news_and_politics/crime/2012/12/gun_death_tally_every_american_gun_death_since_newtown_sandy_hook_shooting.html.Lewis,Jason.“Revealed:The£1BillionHighCostLendingIndustry|TheBureauofInvestigativeJournalism.”Journalism.TheBureauofInvestigativeJournalism,June13,2013.https://www.thebureauinvestigates.com/2013/06/13/revealed-the-1billion-high-cost-lending-industry/.Nguyen,Dan.“WhoinCongressSupportsSOPAandPIPA/PROTECT-IP?|SOPAOpera.”News.ProPublica,January20,2012.http://projects.propublica.org/sopa/.Rogers,Simon.“GovernmentSpendingbyDepartment,2011-12:GettheData.”TheGuardian,December4,2012,sec.UKnews.http://www.theguardian.com/news/datablog/2012/dec/04/government-spending-department-2011-12.———.“JohnSnow’sDataJournalism:TheCholeraMapThatChangedtheWorld.”TheGuardian,March15,2013,sec.News.http://www.theguardian.com/news/datablog/2013/mar/15/john-snow-cholera-map.———.“WikileaksDataJournalism:HowWeHandledtheData.”TheGuardian,January31,2011,sec.News.http://www.theguardian.com/news/datablog/2011/jan/31/wikileaks-data-journalism.———.“WikileaksIraqWarLogs:EveryDeathMapped.”TheGuardian,October22,2010.http://www.theguardian.com/world/datablog/interactive/2010/oct/23/wikileaks-iraq-deaths-map.Rogers,Simon,andJohnBurn-Murdoch.“SuperstormSandy:EveryVerifiedEventMappedandDetailed.”TheGuardian,October30,2012.http://www.theguardian.com/news/datablog/interactive/2012/oct/30/superstorm-sandy-incidents-mapped.Serra,Laura,MaiaJastreblansky,IvanRuiz,RicardoBrom,andMarianaTrigoViera.“Argentina’sSenateExpenses2004-2013.”News.LaNacion,April3,2013.http://blogs.lanacion.com.ar/ddj/data-driven-investigative-journalism/argentina-senate-expenses/.Shaw,Al,JeremyB.Merrill,andZamora,Amanda.“FreetheFiles:HelpProPublicaUnlockPoliticalAdSpending.”ProPublica,September4,2015.https://projects.propublica.org/free-the-files/.“WhereDoesMyMoneyGo?”AccessedSeptember23,2015.http://wheredoesmymoneygo.org/.

TeachingDataandComputationalJournalism

94Appendix

Page 95: Teaching Data and Computational Journalism

LedeProgramCurriculumTheLedeProgramatColumbiaJournalismSchoolisapost-baccalaureateinwhichstudentsfromavarietyofbackgroundslearndataandcomputationskillsoverthecourseofoneortwosemesters.Theprogramwasdesignedtohelpstudentsrapidlyelevatetheirskillsintheseareas,especiallyiftheywereconsideringapplyingforColumbia’shighlydemandingdual-degreeprograminjournalismandcomputerscience.

Inthecontextofthisreport,theone-semesterversionoftheLederepresentsapromising“extendedbootcamp”inwhichstudentswhohavebeenacceptedintoadatajournalismmaster’sprogrammayattendforafullsummerbeforetheirpeersinordertodeveloptheskillsthatwillhelpthemgetthemostoutoftheireducation.

ThefollowingcoursedescriptionswerepulledonNovember5,2015,from:http://www.journalism.columbia.edu/page/1060-the-lede-program-courses/908

FoundationsofComputing

DuringthisintroductiontotheinsandoutsofthePythonprogramminglanguage,studentsbuildafoundationuponwhichtheirlater,morecoding-intensiveclasseswilldepend.Dirty,real-worlddatasetswillbecleaned,parsedandprocessedwhilerecreatingmodernjournalisticprojects.Thecoursewillalsotouchuponbasicvisualizationandmapping,andhowtousepublicresourcessuchasGoogleandStackOverflowtobuildself-reliance.

Focus:Familiarizeyourselfwiththedata-drivenlandscape

Topics&toolsinclude:Python,basicstatisticalanalysis,OpenRefine,CartoDB,pandas,HTML,CSVs,algorithmicstorygeneration,narrativeworkflow,csvkit,git/GitHub,StackOverflow,datacleaning,commandlinetools,andmore

DataandDatabases

Studentswillbecomefamiliarwithavarietyofdataformatsandmethodsforstoring,accessingandprocessinginformation.Topicscoveredincludecomma-separateddocuments,interactionwithwebsiteAPIsandJSON,raw-textdocumentdumps,regularexpressions,textmining,SQLdatabases,andmore.Studentswillalsotacklelessaccessibledatabybuildingwebscrapersandconvertingdifficult-to-usePDFsintouseableinformation.

Focus:Findingandworkingwithdata

Topics&toolsinclude:SQL,APIs,CSVs,regularexpressions,textmining,PDFprocessing,pandas,Python,HTML,BeautifulSoup,IPythonNotebooks,andmore

TeachingDataandComputationalJournalism

95Appendix

Page 96: Teaching Data and Computational Journalism

Algorithms

Machinelearninganddatascienceareintegraltoprocessingandunderstandinglargedatasets.Whetheryou’reclusteringschoolsorcrimedata,analyzingrelationshipsbetweenpeopleorbusinesses,orsearchingforasinglefactinalargedataset,algorithmscanhelp.Throughsupervisedandunsupervisedlearning,studentswillgenerateleads,createinsights,andfigureouthowtobestfocustheireffortswithlargedatasets.Acriticaleyetowardapplicationsofalgorithmswillalsobedeveloped,uncoveringthepitfallsandbiasestolookforinyourownandothers’work.

Focus:Analyzingyourdata

Topics&toolsinclude:linearregression,clustering,textmining,naturallanguageprocessing,decisiontrees,machinelearning,scikit-learn,Python,andmore

DataAnalysisStudio

Inthisproject-drivencourse,studentsrefinetheircreativeworkflowonpersonalwork,fromobtainingandcleaningdatatofinalpresentation.Dataisexplorednotonlyasthebasisforvisualization,butalsoasalead-generatingfoundation,requiringfurtherinvestigativeorresearch-orientedwork.Regularcritiquesfrominstructorsandvisitingprofessionalsareacriticalpieceofthecourse.

Focus:Applyingyourskillset

Topics&toolsinclude:Tableau,webscraping,mapping,CartoDB,GIS/QGIS,datacleaning,documentation,andmore

TeachingDataandComputationalJournalism

96Appendix

Page 97: Teaching Data and Computational Journalism

AcknowledgmentsWecouldnothavedonethiswithouttheassistanceofMaxwellFoxmanandJoscelynJurich,twoPh.D.studentsatColumbia.MaxandJoscelyntraveledfarandwide,scouredtheWeb,andcheckedinnumerablefactsinorderforthisreporttoevencomeclosetodepictingthestateofdatajournalismeducation.Theirthoughtfulmemos,perceptivecomments—andyes,meticulousspreadsheets—werevitalcontributionstothisresearch.

Manyoftheinsightsinthisreportshouldbecreditedtoouradvisorycommittee:SarahCohen,MeredithBroussard,SteveDoig,MichelleMinkoff,ShaznaNessa,JeremySinger-Vine,JonathanStray,MattWaite,andDerekWillis.Ourcommitteemettwice,firsttolaunchtheprojectandframeitsmission,thensevenmonthslatertoreviewourfindingsandhelpustorefineourconclusions.

Anumberofjournalistsandjournalismteachersalsoofferedin-depthinterviewsontheirexperience,thestateofthefield,andtheirvisionforthefuture,andforthatwethankthemforsharingtheirtimeandtheirinsights:JonathonBerlin,RahulBhargava,DavidBoardman,R.B.Brenner,MattCarroll,IraChinoy,BrianCreech,CatherineD’Ignazio,NickDiakopoulos,DavidDonald,JaimiDowdell,DeenFreelon,DavidHerzog,MarkHorvit,BrantHouston,MikeJenner,DanKeating,JenniferLaFleur,DarnellLittle,KathyMatheson,TomMcGinty,PhilipMeyer,GeorgeMiller,T.ChristianMiller,MaggieMulvihill,JonahNewman,BenPoston,KevinQuealy,MikeReilley,SimonRogers,CindyRoyal,JuddSlivka,MargotSusca,BenWelsh,AaronWilliams,andZachWise.

Specialthanksgoouttothosewhoinvitedusintotheirclassroomstowatchhowtheyteach.CatherineD’IgnazioofEmersonCollegeofferedourveryfirstobservationandalsointroducedustohercolleaguesattheMITMediaLabwhoaretacklingsimilarquestionsaboutteachingdata.ZachWiseandLarryBirnbauminvitedustoobservetheirteam-taughtclassatNorthwestern.AmandaHickmanhostedustwiceforherclassattheCUNYGraduateSchoolofJournalism.MeredithBroussardandGeorgeMillereachallowedustoobservetheirclassesatTempleUniversity.Likewise,DanKeatinggraciouslyhostedanin-classobservationattheUniversityofMaryland.

Wewouldalsoliketothankthestudentswhospoketousabouttheirexperiencelearningtousedataintheirreporting:MattBernardini,SimengDai,GeorgeDumontier,AlexDuner,JasmineHan,JohnHilliard,AustinHuguelet,AshleyJones,AnneLi,MaryRyan,andNicoleZhu.Wewishthemfruitfulcareersastheyshapethisfieldofpractice.

TeachingDataandComputationalJournalism

97Acknowledgments

Page 98: Teaching Data and Computational Journalism

Throughthecourseofthisbi-coastalresearchproject,theColumbiaandStanfordcommunitieshaveprovidedanunimaginablelevelofsupportandinspiration.MarkHansen,theEastCoastdirectoroftheBrownInstituteforMediaInnovation,hasarticulatedaremarkablevisionfortheunionofjournalismandcomputation,aswellasamodelforbicoastalcollaboration.SusanMcGregorsharedherdeepunderstandingofjournalism,pedagogy,anddigitalsecurityatcrucialstagesofthisproject,inadditiontohostingusinherclassroomtwice.GianninaSegnini,JamesMadisonVisitingProfessoronFirstAmendmentIssueattheColumbiaSchoolofJournalism,offeredherconsiderableexpertiseintheuseofdataforglobalinvestigativereporting.ThosefromStanfordwhoprovidedcriticalinputandsupportincludeHearstProfessionalinResidenceDanNguyen,PeninsulaPressManagingEditorVigneshRamachandranandGeriMigielicz,theLorryI.LokeyVisitingProfessorinProfessionalJournalism,whoisco-teachingavirtualrealityclassinthewinterof2015.

SteveCollexpressedanelegantandconvincingcasefordatajournalism’splaceincontemporarypractice—fromwhichweborrowedunabashedlyandatlength.ThisreportwassubstantiallyimprovedbyJayHamilton,SarahCohen,JonathanStray,JonathanSoma,andChrisAnderson,whoreadourfirstdraftandofferedremarkablyusefulfeedback.WethankMarciaKramerforthethoughtandcarewithwhichsheeditedthisreport.

WeareindebtedtotheJohnS.andJamesL.KnightFoundationforfundingthisresearch.Knight’scommitmenttoboththefutureofjournalismeducationandtoinnovationinjournalisticpracticedovetailedinthisproject,andwehopethatitisaworthycontributiontoKnight’sgreatermission.

Finally,wethankthepersonwhoconceivedofthisproject,recruitedourteam,andservedasasagelyadvisorateverystep:SheilaCoronel,directorofColumbia’sStabileCenterforInvestigativeJournalism.Sheilarecognizednotonlytheurgencyofdevelopingadatacurriculumbasedonanempiricalstudyofthefield,butalsothevalueofsharingtheresultswithotherjournalismeducators.Whatcouldeasilyhavebeenjustaresearchstudy,orelsejustacurriculumdevelopmentproject,wasenvisionedbySheilaassomethingthatcouldbegreaterthanthesumofitsparts.

TeachingDataandComputationalJournalism

98Acknowledgments