![Page 1: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/1.jpg)
Data Science master track
![Page 2: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/2.jpg)
Scientific questions you will study
• What is clustering?
• What is causality?
• How can one efficiently search and rank?
• Can we build a reliable model from complex data?
![Page 3: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/3.jpg)
Why are these questions important?
To help and improve our society
![Page 4: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/4.jpg)
iCIS data science groups
Prof. Heskes (master track coordinator)machine learning theory and applications
Prof. Lucas Bayesian networks and eHealth
Prof. Kraaijinformation retrieval and multi-media data analysis
Prof. Van der Weide information systems and retrieval
![Page 5: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/5.jpg)
iCIS data science groups
Dr. Marchiori complex networks and machine learning
Prof. Hildebrandtprivacy and legal aspects of data science
Prof. Karssemeijercomputer-aided diagnosis and medical imaging
![Page 6: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/6.jpg)
Mandatory and optional coursesMandatoryMachine Learning in Practice (6 ec) HeskesInformation Retrieval (6 ec) Van der Weide, KraaijBayesian and Decision Models in AI (6ec) Lucas, Velikova OptionalData Science Theory and ToolsStatistical Machine Learning (6 ec) Wiegerinck, Claassen, Kappen, HeskesBio-inspired Algorithms (3 ec*) MarchioriEvolutionary Algorithms (6 ec) Sprinkhuizen-Kuijper, MarchioriMachine Learning (6 ec) Kappen, Wiegerinck Data Science ApplicationsComputer Aided Diagnosis in Medical Imaging (6 ec) KarssemeijerBayesian Neurocognitive Modeling (6ec) Van GervenBioinformatics (3ec) MarchioriPattern Recognition for Natural Sciences (3ec) Buydens and othersIntelligent Information Tools (5 ec**) Van den Bosch Data Science AspectsLaw in Cyberspace (6 ec) HildebrandtFoundations of Information Systems (6 ec) Van BommelCognition and Representation (6 ec) SarboBusiness Rules Specification and Application (3 ec) Hoppenbrouwers
![Page 7: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/7.jpg)
Example: Machine Learning in Practice
• Basic idea: student teams enter an ongoing machine learning competition
• While trying to beat the other teams, students learn the ins and outs of challenging machine learning problems
• Example: learn to detect whale calls in order to prevent collisions
• The Radboud team called “Sushi” iscurrently in the top quarter of more than200 contenders
spectogram with a typical whale call
![Page 8: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/8.jpg)
Example: Bio-inspired algorithms
• Basic idea: student teams investigate diverse types of bio-inspired methods
• The teams choose a problem and solve it using bio-inspired methods
Example: use immune systems mechanisms to develop a method for image similarity search.
Similarity Search using a Negative Selection Algorithm
was accepted at ECAL 12Advances in Artificial Life, 2013
target image
top four retrieved images
![Page 9: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/9.jpg)
Research projects (afdelingsstages)
Join one of the 7 research groups within the institute
![Page 10: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/10.jpg)
Can Google Trends predict outbreaks of influenza?
[1] showed that outbreaks of influenza were correlated to numbers of Google searches for terms related to flu prevention and cure. This and other studies indicate correlations between search volume and (social) behavior, but do this "after the fact" and hence may be susceptible to overfitting.
Do the suggested correlations also apply to novel data? Are other examples of predictive power of Google Trends?
[1] http://www.nature.com/nature/journal/v457/n7232/full/nature07634.html
![Page 11: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/11.jpg)
Predicting protein three-dimensional structures
Protein residue-residue contact prediction can be useful in predicting protein three-dimensional structures.
PSICOV [1] is a state of the art method to predict contacts between residues of a target protein using information from the protein sequences of its family.
Can we exploit biological knowledge toimprove PSICOV?
[1] http://bioinformatics.oxfordjournals.org/content/28/2/184.full
![Page 12: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/12.jpg)
Examples of master thesis projects
Steffen Janssen developed a tool to predict productivity of software projects based on neural networks for the Dutch tax authorities
Kristel Rösken applied data mining to social network profiles for Logica BV
Thomas Janssen improved the fitting of hearing aids by machine learning for the hearing aid company GN ReSound
![Page 13: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/13.jpg)
Examples of master thesis projects
Louis Onrust studied a novel machine learning method for the extraction of brain structure from neuroimaging data
Niels Radstake investigated Bayesian approaches to analyze mammographic images
Jelle Schühmacher came up with a classifier-based method for searching large document collections
![Page 14: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/14.jpg)
Job perspectives A larger company as consultant or data analysis specialist or start up your own company in data analytics, or go for aPhD
Quantitative risk analyst at ABN AMRO Bank (Rasa Jurgelenaite)Senior Scientist at Philips Research (Bart Bakker)Metrology Software Design Engineer at ASM (Pavol Jancura)Business Analist E-business at VVV Nederland BV(Kristel Rösken)OBI4wan(Alex Slatman)
PhD students(Max Hinne, Wout Megchelenbrink)
![Page 15: Data Science master track. Scientific questions you will study What is clustering? What is causality? How can one efficiently search and rank? Can we](https://reader035.vdocuments.net/reader035/viewer/2022062515/56649c525503460f948fba3b/html5/thumbnails/15.jpg)
Unique aspects of our data science track
Diversity: multiple aspects and applications of data science
Excellence: students are embedded in research groups
Flexibility: large choice of courses to shape student interests