Harnessing unstructured data for competitive business Intelligence
Information Technology Services Informatics
Surveys Documents
Web
Regulatory
Analytics
Actionable Insight
Records
Reports Maps
What are the sources of unstructured data for the organization ?
Presenter
Presentation Notes
Index document fracturing Named value processing Business Intelligence for text Category /taxonomy parameters
Information Technology Services Informatics
Unstructured
Semi-Structured
Structured
Surveys Analytics
Web
Regulatory
Doc Mngt
Records
Reports Maps
How to group data in terms of structure ?
Presenter
Presentation Notes
The most unstructured of all are hand written notes...these can now read and digitised using different techniques Records, digital asset management, web and documents are generally termed Enterprise content management. From an analyst point of view this data is not often structured in most cases for simple drill down query analysis linking multiple data types. The exception tends to be coded surveys. This is changing with Datawarehouse 2.0 where structured and unstructured datasets are slowing coming together. The best examples of this are products like Alfresco or the ETL tools that are extract unstructured data sets / commentary
Information Technology Services Informatics
The amount of content and relationships keep growing
The ways in which organisational data is constantlychanging and evolving
Presenter
Presentation Notes
Digital Life One noticeable impact of the digital tsunami rolling through our lives is that attention span is dropping year on year. Studies show that information workers and executives now switch tasks an average of every three minutes throughout the day. Read More about Digital Overload Overall attention spans have halved to just five minutes over the last decade So people are beginning to adapt to these highly interrupted ways. The challenge for information mangement appears to be deliver and search even quicker for relevant and valuable content. Executives making decisions are now required to process detailed analysis in minutes and make decisions. One way to meet this attention span is to make it more linkable, visual, shareable and intuitive. Some people are skilled more in their natural ability to take a quick look at data and see important distinctions. This ability to 'see with our brains' has been written about in effective communication literature (Reference 2). There is also another change in behaviour that is apparent where consumption habits are changing with new devices and technologies. So getting information more accessible, transparent and shareable is key objective for analytics and memory at Macquarie. In 2010 research "2 out of 3 consumers watch four or more programs on four or more channels per week." " One third of adults watch some content via alternative devices." Reference 2 Information Dashboard Design; The effective Visual Communication of Data, Stephen Few, O'Reilly
https://www.leximancer.com/ (best demo I've seen so far for a text mining tool. Company is based in Queensland) http://www.clarabridge.com/ (nice demo, commercial software) http://gate.ac.uk/overview.html (open source) http://uima.apache.org/ (open source) http://www.megaputer.com/polyanalyst.php (nice diagram, commercial software) http://www.lexalytics.com/ (demo app downloadable)
Information Technology Services Informatics
Dealing with change in UnStructured And Semi-structured data ?
Presenter
Presentation Notes
Is it possible to manage these things with an agreement to ensure information not vulnerable to disclosure, loss or compromise ? Cloud computing poses new challenges. Classifying your important business records Information into a classification scheme that relates closely to the organisation taxonomy HOW Index document fracturing Named value processing Business Intelligence for text Category /taxonomy parameters
Information Technology Services Informatics
Presenter
Presentation Notes
Durable Referencing In here, "referencing" just means the URI for the object - a permanent, unbreakable locator for the object. When you put something in Truth, it becomes part of the truth - it is there to stay, and it can always be found. Note that in formal records management, "referencing" has a special meaning - this is not what we mean here. Easy Sharing Once you have a permanent URL for an object, sharing becomes easy. Pasting into emails, wiki pages and anywhere else is a cinch. Sharing URLs however, is only the first step. Better Wiki integration via plugins will make both products truly complimentary. Dedicated applications with Spring Surf for Alfresco Sites makes blue skies the limit. Better Process (with positive side effect) With the addition of the Activi BPMN workflow engine, sites in Truth can automate business processes. Using a bit of custom development, we can offer extended services within Truth that, besides automation, also lead to a better business process. Better Culture This is the ultimate goal. This is not a piece of technology, or an automated process. This is where our human nature, our work practices, and the technology tools we use all converge together to give a total greater than the sum of the parts. In an environment where the mundane bits are automated, the tools aid rather than hinder, and our ability to communicate is free an easy we can be happy and more productive.
Information Technology Services Informatics
Surveys Documents
Web
Regulatory
Analytics
Actionable Insight
Records
Reports Databases
If the information is available at the right place, right time Then you naturally must have a better process and this improves Culture.