www.fit.qut.edu.au queensland university of technology fit school of information systems mm 1 cricos...
TRANSCRIPT
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 1 CRICOS No. 00213J
Maintenance
INFORMATION LIFE CYCLEcreate
distribute
use
maintain
recall
reuse
store
dispose
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 2 CRICOS No. 00213J
Subject analysis
Examination of a document to determine the subject content and describe it fully. It results in assigning:
• Subject heading(s) • Descriptor(s)• Classification codes
Carried out to facilitate subject searching
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 3 CRICOS No. 00213J
Aboutness
• What’s it all about?
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 4 CRICOS No. 00213J
SUBJECT CONTROL• Indexing and Classification
• The ‘Art’ of assigned indexing:– Empathy– Meticulousness– Consistency– General knowledge– Patience
• Knowledge-based system?
• Index term maintenance (metainformation)– classification scheme development– thesaurus control
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 5 CRICOS No. 00213J
Indexing alternatives• Assigned Indexing (indexer-assigned)
– Human problems• Cost• Inaccuracy• Inconsistency
• Derived Indexing (using terms existing in text)– Computer problems
• Empathy• General knowledge• Flexibility
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 6 CRICOS No. 00213J
Indexing Language
• A list of terms or notations that might be used as access points in an index
• The set of terms (the vocabulary) AND the devices for handling the relationship between them in a system for providing index descriptions.– Alphabetical indexing languages
– Classification schemes
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 7 CRICOS No. 00213J
Assigned indexing Principles
• Alphabetical vs Classified
• Pre-coordinate/Post-coordinate
• Semantics vs syntax
• Specificity vis generality
• Subdivisions
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 8 CRICOS No. 00213J
Alphabetic-specific/Alphabetic-classified
Specific
Direct entry
Amalthea 234
882Ganymede 882Moon (Earth's natural satellite) 109
ClassifiedIndirect entry
SatellitesNatural satellites Jupiter satellites
Amalthea 234882
Ganymede 882Io 78
545 Mars satellites
.
. Moon (Earth's natural satellite)
109
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 9 CRICOS No. 00213J
Battle of the cookbooks
EGGS 28.
Omelettes 30
Apricot soufflé omelette 32
Basic omelette 32
Cheese fluffy omelette 32
Cheese omelette 31
Dessert omelettes 32.
.
Sauces 36
Aurora sauce 38
Basic white sauce 36.
.
FISH
Fish44
Baked fish 48.
.
Shellfish 52
Crabs 54.
.
.
.
Coq au vin, 296
coriander, 276
Chicken Breast with Ham, Noodles and Coriander, 276
Tripe with coriander, 258
corn, 29
Beef and Corn Pie, 102
Chicken and Sweetcorn omelettes, 305
Corn Cooked with Red Pepper, 29
Soup of Pureed Corn with Noodles and Chicken, 14
courgettes see zucchini
crab, 59
A Fragrant Stew of the Sea, 68
Crab in Parsley Crepes, 59
crayfish, 60.
.
.
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 10
CRICOS No. 00213J
Coordination of descriptors
• Pre-coordinationConcepts are combined at time of assigned indexing
Chemistry -- Study and teaching (Secondary) (LCSH)
• Post-coordinationConcepts held separate at time of indexing on assumption that they will be
combined at time of retrieval
ChemistryScience instructionSecondary education (ERIC)
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 11
CRICOS No. 00213J
Semantics of assigned indexing• Categories of concept (from Aristotle to …)
• Reality (ontological) or expressions of reality
• Word variations– Nouns; singular vs plural– Homographs
• Relationships– Equivalence– Hierarchy– Affinitive/associative
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 12
CRICOS No. 00213J
Syntax of assigned indexing• Relational operators
– ‘and’ to show relationship exits– Operations such as functional dependence (Games played by Queensland),
(Farradane’s ‘analets’)
• Role indicators– 'demonstration', 'radiation', 'food', 'student', and 'cooking’
– BOTTLES; polyethylene, blow moulded (eg. From BTI)
• Combination order– Geographical photography or photography in geography
• Chaining– Ethnic groups–young people-ethnic identity-psychotherapy-cultural aspects
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 13
CRICOS No. 00213J
Specificity
Finding an expression for the complete content• Classification• Subject headings• The generality of the content
Finding an expression for individual parts• Indexing• Thesauri• Specificity
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 14
CRICOS No. 00213J
Subdivisions
• Subheadings that may be repeatedly used• Generally not used in thesauri (post-coordination
expected)• Standard subdivisions in classification schemes
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 15
CRICOS No. 00213J
Derived indexing principles
• Index files
• Computer-assisted/ Computer-based
• Vocabulary control
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 16
CRICOS No. 00213J
CLASSIFICATION
• Natural
• Artificial
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 17
CRICOS No. 00213J
Controlling the subject content
Controlled vocabularies
• Classification schemes
• Thesauri
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 18
CRICOS No. 00213J
CLASSIFICATION SCHEMES
• Document-based ACM, UDC, Colon, LC
• Role-based ASIC, ASCO, Patient
• Discipline-based Music, law....
• Internet applicationsYahoo, ACM, Dewey...
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 19
CRICOS No. 00213J
Composite subjects
• Waste Disposal
• Statistics on the health of Australians
• Agriculture as depicted in the paintings of the impressionists
• Computer-controlled mining operations
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 20
CRICOS No. 00213J
CLASSIFICATION METHODS
• Enumerative– Each subject present in the collection that it is
intended to classify has its own notation or term - all simple & all complex subjects are listed
• Faceted– Subjects are considered as simple Isolates, and a
facet which describes a subject is the sum of its isolates
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 21
CRICOS No. 00213J
Thesaurus extract
35 mm CAMERAS
BT MINIATURE CAMERAS
CAMERAS
BT OPTICAL EQUIPMENT
NT MOVING PICTURE CAMERAS
STEREO CAMERAS
STILL CAMERAS
UNDERWATER CAMERAS
RT PHOTOGRAPHY
CINE CAMERAS
BT MOVING PICTURE CAMERAS
NT UNDERWATER CINE CAMERAS
RT CINEMA
CINEMA
RT CINE CAMERAS
DIVING
RT UNDERWATER CAMERAS
INSTANT PICTURE CAMERAS
SN Cameras which produce a finished
print directly
BT STILL CAMERAS
Land cameras USE VIEW CAMERAS
MICROSCOPES
BT OPTICAL EQUIPMENT
MINIATURE CAMERAS
BT STILL CAMERAS
NT 35 mm CAMERAS
MOVING PICTURE CAMERAS
BT CAMERAS
NT CINE CAMERAS
TELEVISION CAMERAS
OPTICAL EQUIPMENT
NT CAMERAS
MICROSCOPES
PHOTOGRAPHY
RT CAMERAS
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 22
CRICOS No. 00213J
Thesaurus FeaturesA thesaurus is the vocabulary of a controlled indexing language
formally organised so that the a priori relationships between concepts are made explicit.
• Hierarchical structure • Syndetic Structure• Descriptor Construction• Term Display
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 23
CRICOS No. 00213J
Classification Scheme Features
• Notation• Structure
– Relative index to schedules
• Naturalness– Literary warrant
– Orientation
• Detail of classes• Subdivisions• Integrity of numbers
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 24
CRICOS No. 00213J
Knowledge-based systems
• Rule-based• Semantic net• Frame-based
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 25
CRICOS No. 00213J
Camera in a frame
Slot Permitted category Instance
Frame type tool camera
Part of tool optical equipment
Process operation take photograph
Agent person photographer
Time phase night
Place building toolshed
With tool tripod
www.fit.qut.edu.au
Queensland University of Technology FIT School of Information Systems MM 26
CRICOS No. 00213J
Summary
• Aboutness– Retrieval relevance
• Subject analysis– Assigned– Derived
• Control of subject content– Classification schemes– Thesauri