r&d search 081013 search solutions conference

19
Rapid Delivery Of Business Intelligence Applications Through R&D Search Experience Search Solutions 2013 Tuesday October 8 th Nick Brown, Susan Donohoe, Rob Hernandez, Youssef Belghali, Nasko Radev, Steve Woodward & Akshay Tankhiwale

Upload: nick-brown

Post on 19-Jan-2015

2.071 views

Category:

Technology


0 download

DESCRIPTION

Presentation given by Nick Brown (AstraZeneca) on 8th October 2013 at the Search Solutions 2013 conference.

TRANSCRIPT

Page 1: R&D Search 081013 Search Solutions Conference

Rapid Delivery Of Business Intelligence Applications Through R&D Search Experience

Search Solutions 2013

Tuesday October 8th

Nick Brown, Susan Donohoe, Rob Hernandez, Youssef Belghali, Nasko Radev, Steve Woodward & Akshay Tankhiwale

Page 2: R&D Search 081013 Search Solutions Conference

AstraZeneca Health Connect Us All

AstraZeneca is a biopharmaceutical company with Research and Development at

its core. Our business is providing innovative, effective medicines that make a real

difference to patients. We focus on six important areas of healthcare.

In R&D, we invest over $4 billion every year and with over 15,000 professionals

in 8 countries, on 3 continents, accessing and leveraging information is key.

Page 3: R&D Search 081013 Search Solutions Conference

Distributed R&D Leads to Information Silos

Photo Credit: http://cdn-wac.emirates247.com/polopoly_fs/1.509718.1370831315!/image/256556252.jpg

Page 4: R&D Search 081013 Search Solutions Conference

4. Insight & Analytics

Publications Trials

Conferences News

Patents

Grants

RDF data

Oracle Data Marts

CRM SharePoint PKT

Yammer File shares

LDMS

Wiki

Unstructured External structured Internal Unstructured Internal

1. ETL

2. Ontology

Enrich

Auto-Tagging

Auto-Class

Text Mining

Entity Extraction

3. Search

Index

NLP

Rules Match

Normalization

Cluster

5. Business

Applications

Existing Semantic Search Architecture

Page 5: R&D Search 081013 Search Solutions Conference

Rapidly deliver mobile business intelligence applications 1 5

Connectors to any unstructured and structured sources 1 1

Accurate semantic mark-up with text-mining capabilities 1 2

Intelligent, intuitive search that hides the advanced features 1 3

Generate insight & analytics across information types 1 4

Strategic Approach Technology Stack

3 months ago, we licensed Sinequa for our R&D search platform.

Page 6: R&D Search 081013 Search Solutions Conference

Advanced Widgets Built To Be Put Together Easily In Different Ways

Photo Credit: http://media2.ph.88db.com/DB88UploadFiles_med2/2010/05/08/1909CD39-2608-4333-B7D5-16F79E0FA1D4.JPG

Page 7: R&D Search 081013 Search Solutions Conference

Virtual Team Connected By Passion

To build our applications rapidly, we supplement our team with external experts,

including running competitions on open innovation platform like TopCoder.

Page 8: R&D Search 081013 Search Solutions Conference

External Data Sources Easily Connected

In R&D, we have over 200 million documents in publications, patents and

conference abstracts. Having a historical perspective can help when designing

business intelligence applications like breaking science or target selection

Publications Patents

Conferences

Clinical Trial Registries Grants

25M

80M 60M

Page 9: R&D Search 081013 Search Solutions Conference

R&D Wiki

Internal Data Sources Security & Access Control

Department

Fileshares

The richest, most valuable content is our internal data sources. Our systems

adheres to our security controls – you only find what you have access to…

Page 10: R&D Search 081013 Search Solutions Conference

R&D Search Screenshot

We automatically search other synonyms like

Vandetanib and internal identifiers such as ZD6474

Top hits are now key relevant scientific documents

R&D vocabs are dispayed, from brand, disease,

scientists & mechanisms such as EGFR and VEGFR

R&D Search can handle a number of languages

Page 11: R&D Search 081013 Search Solutions Conference

Developed new approaches within Sinequa to allow easy vocabulary curation. --

Tagging scores allow us to identify documents with no tags or too many tags

Hiearchical synonym trees help to rapidly identify problem terms like ‘when’

Individual documents display number of synonym occurrences.

R&D Vocabularies Screenshot

Focused on vocabularies that are important to scientists :-

Drugs Diseases Genes MicroRNA

People Companies Organisms Cell-lines

Cell types Technology Skills Safety Mechanisms

Page 12: R&D Search 081013 Search Solutions Conference

R&D Department Screenshot

Teams can search across this rich internal content and find not just relevant

documents but also other drugs, mechanisms and even people to help.

Page 13: R&D Search 081013 Search Solutions Conference

R&D Journal Screenshot

Developed to look like an external scientific journal, R&D Journal provides a mechanism within AstraZeneca where our scientists can publish articles and experimental reports that can be shared and pushed out to other members of the department Other users can add ratings and comments, as well as sign up for alerts and search across this content

Page 14: R&D Search 081013 Search Solutions Conference

R&D Labs Mobile Access To Apps

Currently piloting Amazon web-services with Ping Federate (authentication) and Data Power (access), to enabled mobile applications to query against our search index: drug repositioning life cycle management external KOL identification conference capture breaking science chemical search

Page 15: R&D Search 081013 Search Solutions Conference

R&D Experts Find & Connect within AZ & MedImmune

Experts allows R&D to find and connect to the

key experts on any scientific topic.

Minimise duplication

Increase cross R&D collaboration

Automatically updated

Recommend new contacts

Curate & advertise yourself

Social network analysis & visual

connectivity

Page 16: R&D Search 081013 Search Solutions Conference

Next Steps More R&D Indexing

Photo credit: http://chamorrobible.org/images/photos/gpw-200904-NASA-ISS016-E-37922-The-World-Dubai-United-Arab-Emirates-20080403-large.jpg

Page 17: R&D Search 081013 Search Solutions Conference

Next Steps More Business Applications

Deliver applications that use analytics across the entire document index such as

drug repositioning and external KOL identification, made mobile.

Page 18: R&D Search 081013 Search Solutions Conference

Next Steps More Search Widgets

Further collaborate with Sinequa to implement other features around

visualisation, feedback & commenting and new search relevancy algorithms

Page 19: R&D Search 081013 Search Solutions Conference

Thank You Acknowledgements & Questions

Delivering this in the past 12 weeks wouldn’t have been possible without an

enormous amount of support from many people, not all listed here today.

Sinequa: Christian Sestier, Tim Bell, Xavier Pornain, Ariane Cavet, Frédéric

Lardé, Olivier Gaunet & Alex Bilger

Pebble Code: John Mildinhall, Tak Tran, Mark Durrant & Toby Hunt

AstraZeneca: Youssef Belghali, Tim McCoy, David Rafferty, Nick Barlow, Tania

Hide, Lisa Taylor, Hari Radhakrishnan, Adel Kassim & Pete Dudek.

Finally many thanks to Sebastian Lefebvre, Jason Swift & Paul Fitzpatrick for

sponsoring and helping us to get this project launched.