2e components

4
I2E Pipeline Pilot Components Integrating Text Mining into Your Workflow Linguamatics I2E Pipeline Pilot Components help you to effectively integrate the powerful text mining capabilities of I2E into your scientific workflow. Each organization has its own way of working in order to fulfill its business needs. A workflow is built up to reflect these business processes and tools are put in place to support them. I2E Pipeline Pilot Components enable you to include agile NLP-based querying in a scientific workflow developed using Accelrys Pipeline Pilot. Using I2E Pipeline Pilot Components within your workflow makes text mining available to a wider range of users, enabling them to easily perform complex text mining operations on large numbers of documents, extracting cricital knowledge to support business decisions. I2E Pipeline Pilot Component Benefits: Enables your workflow to support faster and better business decisions using agile high performance text mining Easy integration into your overall workflow Get up and running quickly. 'Out of the box' examples provided Easily customize to your own needs Allows I2E to be integrated with your other business applications Access Content Varied Sources Access Content Varied Sources Preprocess Content Augment Merge Convert to XML Preprocess Content Augment Merge Convert to XML Index Content Index Content Query Query Development and refinement Query Query Development and renement Results Curate View Feedback into Query Dev Results Curate View Feedback into Query Dev Post Process Results Convert Graphical Viewers Post Process Results Convert Graphical Viewers Publish Analysis Portals Email Publish Analysis Portals Email Websites Websites Scientific Literature Scientic Literature Internal Documents Internal Documents News Feeds News Feeds ClinicalTrials.gov ClinicalTrials.gov Patents Patents Figure 1 Integrate agile NLP-based querying in a scientific workflow 1

Upload: others

Post on 11-Feb-2022

13 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 2E Components

I2E Pipeline PilotComponents

3© 2012 Linguamatics Ltd. The Linguamatics logo is a trademark of Linguamatics Ltd. All rights reserved. All other trademarks mentioned in this document are the property of their respective owners.

Integrating Text Mining into Your Workflow

Linguamatics I2E Pipeline Pilot Components help you to effectively integrate the powerful text mining capabilities of I2E into your scientific workflow.

Each organization has its own way of working in order to fulfill its business needs. A workflow is built up to reflect these business processes and tools are put in place to support them. I2E Pipeline Pilot Components enable you to include agile NLP-based querying in a scientific workflow developed using Accelrys Pipeline Pilot.

Using I2E Pipeline Pilot Components within your workflow makes text mining available to a wider range of users, enabling them to easily perform complex text mining operations on large numbers of documents, extracting cricital knowledge to support business decisions.

I2E Pipeline Pilot Component Benefits:

Enables your workflow to support faster and better business decisions using agile high performance text mining

Easy integration into your overall workflow

Get up and running quickly. 'Out of the box' examples provided

Easily customize to your own needs

Allows I2E to be integrated with your other business applications

AccessContent

VariedSources

AccessContent

VariedSources

PreprocessContent

AugmentMergeConvert to XML

PreprocessContent

AugmentMergeConvert to XML

IndexContentIndexContent

Query

QueryDevelopment andrefi nement

Query

QueryDevelopment andrefi nement

Results

CurateViewFeedback into Query Dev

Results

CurateViewFeedback into Query Dev

Post Process Results

ConvertGraphical Viewers

Post Process Results

ConvertGraphical Viewers

PublishAnalysis

PortalsEmail

PublishAnalysis

PortalsEmail

WebsitesWebsites

Scientifi c LiteratureScientifi c Literature

Internal DocumentsInternal Documents

News FeedsNews Feeds

ClinicalTrials.govClinicalTrials.gov

PatentsPatents

Figure 1

Integrate agile NLP-based querying

in a scientific workflow

1

Page 2: 2E Components

4

I2E Pipeline Pilot components can be used to integrate text mining into larger data processing and analysis workflows. The components enable your workflow to support faster and better business decisions using agile, high performance text mining. The set of components provided allow modular access to I2E features.

Building Indexes in Pipeline Pilot

To prepare source data for querying by I2E, use the I2E Index Maker component:

Works from many common data types or data can be pre-processed with Pipeline Pilot data conversion components

Choose from 6 preset indexing configurations or add your own

I2E Index Maker submits a command to I2E, which generates the index on the I2E Server

Components output file name and location, which can be passed to other components for further processing.

Figure 2

The components

provide many different

touch-points with I2E

functionality.

Figure 3

Easily make indexes from

selected document sets.

2

Page 3: 2E Components

5

Running I2E Queries in Pipeline Pilot

To include an I2E query in a Pipeline Pilot protocol, use the Query Submitter component:

Build queries to run on prepared document sets (indexes)

Component generates and submits a batch command to I2E. (You need a licence for I2EBatch Querying.)

Output results in a range of formats for further data processing and analysis, such as TSV, CSV and XLS

View results using Pipeline Pilot component readers and viewers.

RequirementsIn order to use I2E Pipeline Pilot Components you must have:

I2E Server for Linux installed and running

Pipeline Pilot installed and running

The I2E Query Submitter Component requires an I2E Batch Query license

I2E Pipeline Pilot Components help you to effectively integrate the powerful text mining capabilities of I2E into your scientific workflow

For more information or a demonstration, please call us today

+44 1223 651910 [email protected]

Figure 4

Embed queries into your workflow to

extract knowledge.

3

Page 4: 2E Components

2

Linguamatics 324 Cambridge Science Park

Milton Road Cambridge CB4 0WG UK

Tel: +44 (0)1223 651 910

Linguamatics 1900 West Park Drive Suite 280

Westborough MA 01581 USA

Tel: +1 617 674 3256

© 2012 Linguamatics Ltd. The Linguamatics logo is a trademark of Linguamatics Ltd. All rights reserved. All other trademarks mentioned in this document are the property of their respective owners.

www.linguamatics.com

How You Can Benefit

Linguamatics I2E text mining platform can dramatically improve speed to knowledge discovery, resulting in time and cost savings for your organization. Use of Linguamatics’ text mining platform enables companies to discover more and make better decisions faster, with the very tangible benefits of higher return on investment, greater accuracy of results and improved effectiveness of analytical and research efforts.

Customers have reported that our I2E software has provided at least a 10x speed up in getting to actionable results compared to

using conventional search approaches.

Benefits are realized across entire companies, as well as at the level of an individual executive, analyst or informatician. The I2E platform enables the automation of previously manual, repetitive tasks, making users much more productive and freeing them to focus on querying strategies and interpretation of results. In many cases, I2E has made projects successful that would otherwise be impossible or too time prohibitive.

About Text Mining

Text mining interprets the meaning of text and extracts high value relationships. It goes way beyond traditional search.

A large proportion of scientific knowledge is only available within text. This may be scientific papers, internal reports, patents, news feeds or text fields within semi-structured data such as medical records or electronic lab notebooks. Given the unprecedented and continuing increase in available textual information, new methods are required to extract relevant, decision-critical information.

Advanced text mining technology is now established as an automated solution for extracting and connecting knowledge from textual information sources at large scale. The technology has been making a significant impact in life sciences, for example safety, clinical trials, systems biology, target selection, patent analysis and competitive intelligence.

Text mining methods have the ability to analyze unstructured or semi structured text and derive concepts, structure and relationships from it. It is an alternative approach to traditional search engines using keywords as a way of retrieving documents to find information.

About I2E

Linguamatics I2E is an agile, scalable, high performance text mining system that aids organizations in discovering and synthesizing knowledge from unstructured text in large document collections. It has a proven track record in delivering best-of-breed text mining capabilities covering a broad range of application areas. Its agile nature allows tuning of query strategies to deliver the precision and recall you need for your specific task, at enterprise scale.

Linguamatics offers a variety of ways to access I2E:

• Deploying in-house with I2E Enterprise

• Accessing I2E via the cloud

• I2E OnDemand

• I2E Managed Services

I2E can also be integrated with other tools and built into workflows, for example using the I2E Pipeline Pilot Components.

About Linguamatics

Linguamatics enables organizations to maximize the valuederived from information resources through effectivedeployment of innovative natural language processing(NLP)-based technology. Linguamatics has a rapidly growing user community with I2E deployed at 9 of the top-10 pharma companies. The company was founded in 2001, and is based in Cambridge, UK and offices near Boston, USA

Contact Us

For further information, please email [email protected], or call or write to us at the addresses provided below.

4