lightning flash on etds. data [re]cycle 1.candidate to proquest 2.proquest markup in xml 3.locally...
TRANSCRIPT
Lightning Flash on ETDs
Data [re]Cycle
1. Candidate to ProQuest 2. ProQuest markup in XML3. Locally transform XML for IRO 4. Convert to MARC for OCLC/InfoHawk
Original ProQuest data
XML after transformation
XSLT for bepress conversion
Stylesheet instructions to• Standardize capitalization• Add field for bibliographic references• Add note for optimized files• Control vocabulary for degrees
XSLT Conversion Files
Discpline, Department, Optimization
Thesis Advisor
Controlled Names and Vocabularies
Standard Character Set
Character Conflict
• Bepress (IRO)– Allows formatting (italics, bold, superscript)– Does not allow actual characters or name entities– Accepts only numeric entities
• OCLC – Does not allow formatting – Does not allow name entities – Allows numeric entities or the actual character
HTML file check
Upload to Bepress
Convert XML MARC/XML
• Title in 245• 2nd indicator = 3
when title begins with “An ”
• 2nd indicator = 4 when title begins with “The ”
MarcEdit
Export to OCLC