subproject 4: html-wml transcoding system jia-shung wang computer science department national tsing...
Post on 19-Dec-2015
215 views
TRANSCRIPT
![Page 1: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/1.jpg)
Subproject 4: HTML-WML Transcoding
System
Jia-Shung Wang Computer Science DepartmentNational Tsing Hua University
March 27, 2001
![Page 2: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/2.jpg)
Outline
• Motivation and Issues• Examples of Transcoding • System Overview and Translation
Flow• Some HTML to WML Conversion
Strategies
![Page 3: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/3.jpg)
Information Appliances
• Different design constraints based on intended use, enhances ease of use– Desktop PC– Mobile PC– Desktop “Smart” Phone– Mobile Telephone– Personal Digital Assistant– Set-top Box– Digital VCR– …
• Implications: – Shift from computer design to consumer design– Heterogeneous “standards,” hybrid networking– Interactive networking, access on demand, QoS
![Page 4: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/4.jpg)
Motivation Rapidly growing diversity of wireless
communication devices
The incredible growing of the amount of available HTML web pages on the Internet
Solutions for mobile devices with WML browsers to access the existing HTML or WML pages on the Internet.
![Page 5: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/5.jpg)
IssuesDevice-enabled service for WML
mobile devices with different types of screen
Bandwidth-driven transmission for rapid response and fast delivery speed
The usage of browsing behaviorThe resizing of images /iconsThe compression of the resulting
WML data
![Page 6: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/6.jpg)
Demos of Transcoding
•Contents fromenYES 鉅亨網USAtodayCS, NTHUNTHUVOD
![Page 7: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/7.jpg)
DiscussionsenYES provides two versions: regular HTML
and WAP to serve PC users and mobile device users separately.
USAtoday also provides content (simplified version) for users with Palm.
NTHU, CS-NTHU homepages : If we keep the original figure for saving the link information, then the page layout becomes old. (using HTML browser with:Browse-It).
VOD homepage, one-column text: no significant difference after transcoding.
![Page 8: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/8.jpg)
![Page 9: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/9.jpg)
![Page 10: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/10.jpg)
![Page 11: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/11.jpg)
![Page 12: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/12.jpg)
![Page 13: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/13.jpg)
![Page 14: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/14.jpg)
![Page 15: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/15.jpg)
![Page 16: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/16.jpg)
![Page 17: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/17.jpg)
![Page 18: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/18.jpg)
![Page 19: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/19.jpg)
![Page 20: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/20.jpg)
![Page 21: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/21.jpg)
![Page 22: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/22.jpg)
![Page 23: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/23.jpg)
![Page 24: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/24.jpg)
Usage of Browsing Behavior
The automatic translation seems complicated because of the diversity of content posted on an HTML page.
It is unlikely to have a universal conversion strategy to translate every HTML page to sequences of WML decks effectively.
However, it seems a good idea to categorize the browsing behavior to classify the HTML page to be translated first.
![Page 25: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/25.jpg)
Usage of Browsing Behavior (cont’d)
After doing that we may realize what the client requires. Then we can have a corresponding conversion to extract the acquired content step-by-step and translate them into some predictable and small sized WML documents.
We believe that there would be some adequate conversions for some kinds of web pages after classification.
![Page 26: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/26.jpg)
Related Works Transcoding Proxy of IBM alphaWorks
It has a goal to manager different version of contents with different fidelities and modalities in order to adapt the delivery to different client device.
![Page 27: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/27.jpg)
Related Works Intel Quick Web Technology
• New software capability that helps Internet providers and digital distribution companies increase the delivery speed of Web pages containing photos, drawings and other graphics.
• It uses two key techniques, “Compresses” and “Caches”.
![Page 28: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/28.jpg)
Related WorksSpyglass Prism
• Spyglass Prism dynamically adapts Web content to match various non-PC devices.
• It functions as a proxy server, caches the converted content, and dynamically converting standard HTML to WML.
![Page 29: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/29.jpg)
Related WorksProxy Architecture for Efficient Web
Browsing over Cellular Networks
• Decreases the access time of browsing WWW in narrow-band wireless environment.
• It adopts persistent connection and pipelining technique based on proxy architecture to improve the HTTP process between the client and the proxy server.
![Page 30: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/30.jpg)
Comparisons betweenHTML and WML
• Both make use of tags and attributes.• Similar character set, syntax and data
types.• Two special elements of WML structure
– Deck and Card
• Different design goal– HTML: To Publish hypertext on the World Wide
Web– WML: For narrow network bandwidth devices
with small displays, limited memory and fewer computational resources.
![Page 31: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/31.jpg)
Examples of HTML and WML
WML<wml> <deck> <card> <p> <do type="accept"> <go href="#card2"/> </do> This is the first card... </p> </card> <card id="card2"> <p> This is the second card. </p> </card> </deck></wml>
HTML<html> <head> <title> Example page. </title> </head> <body> <h1> This is a headline. </h1> <p> This is a paragraph. </p> </body></html >
![Page 32: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/32.jpg)
System Overview
Web Server
MultimediaContent
CGIScripts
etc.
Translation Server
WML Generator
Client
WML
WMLBrowser
Etc.
HTTP
HTML ParserWAP
HTML-WMLTranslator
HTML, WML
Documents
HTTP
![Page 33: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/33.jpg)
Features
• An HTML-WML Translator on the Translation Server
• Both HTTP and WAP requests are acceptable.
• Java Servlet API compatible• Server- and platform-independent
![Page 34: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/34.jpg)
Translation Server: Components
and Flow
NetworkProtocol
Proxy
HTMLParser
FilterDocumentAnalyzer
Decks &Cards
WMLGenerator
LinkBuilder
Request Request
Response Response
![Page 35: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/35.jpg)
Components
• Gateway– Accept requests from clients – Return appropriate responses
• Proxy Servlet– Get the requested remote documents– Determine to pass or convert– Cache the converted results
![Page 36: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/36.jpg)
Components (cont’d)
• HTML Parser– Parse the HTML document as a parse
tree• Document Analyzer
– Analyze the parse tree• Filter
– Filter any objects unnecessary or not supported by the client device
– Image/icon resizing
![Page 37: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/37.jpg)
Components (cont’d)
• Content Divider– Split a document into multiple,
small-size documents • Link Maker
– Insert extra links to make small documents reach one another
• WML Generator– Produce well-formed WML documents
and return them to Proxy Servlet
![Page 38: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/38.jpg)
HTML to WMLConversion Tools
• Semi-automatic:– Used for rich HTML documents – The conversion form is designated
manually with the help of analysis and editing tools.
– The resulting forms are distributed to the gateway servers.
• Automatic:– Used for simple documents, such as News
and BBS, …
![Page 39: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/39.jpg)
HTML to WMLConversion Strategies
• Strategy I: Tables to Lists– Simply removing all layout elements
such as table– Let all the contents arrange into only
one column with a fixed width• Strategy II: One Table One Deck
– Extracting each table to form a deck
![Page 40: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/40.jpg)
HTML to WMLConversion Strategies (cont’d)
• Strategy III: Preview Firsta. One Table One Deckb. Collect all the first card of every deck
as preview cardsc. Arrange these preview cards to form
an preview deck, which will be transmitted first, every preview card will have a link to its corresponding deck
![Page 41: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/41.jpg)
Original Document
<document>
<table>
<table>
<table>
< section 4>
<section 1>
<section 2>
< section 3>
<content 1_1>
<content 1_2>
<content 4_1>
<content 2_1><content 2_2><content 2_3><content 2_4>
<content 3_5><content 3_6><content 3_7>
<content 2_5><content 3_1><content 3_2><content 3_3><content 3_4>
![Page 42: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/42.jpg)
Tables to Lists
<document> <deck>
<content 1_1><content 1_2><content 2_1><content 2_2><content 2_3>
<deck>
<deck>
<content 2_4><content 2_5><content 3_1><content 3_2><content 3_3>
<content 4_1>
<content 3_5><content 3_6><content 3_7>
<content 3_4>
![Page 43: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/43.jpg)
One Table One Deck
<document>
<deck>
<content 1_1>
<content 1_2>
<content 2_1><content 2_2><content 2_3><deck>
<deck>
<content 2_4><content 2_5><content 3_1><content 3_2><content 3_3>
<content 4_1>
<content 3_5><content 3_6><content 3_7>
<content 3_4>
<deck>
<deck>
![Page 44: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/44.jpg)
Preview First
<document>
<deck>
<content 1_1>
<content 1_2>
<content 2_1>
<content 2_2><content 2_3>
<deck>
<deck>
<content 2_4><content 2_5>
<content 3_1>
<content 3_2><content 3_3>
<content 4_1>
<content 3_5><content 3_6>
<content 3_7>
<content 3_4>
<deck>
<deck>
![Page 45: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/45.jpg)
Strategy Evaluation
• Assuming we have S sections in a document and the document is translated to N WML cards.
• Every deck contains at most C cards.
• Assuming that the contents in the same tables are similar.
![Page 46: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/46.jpg)
Evaluation of Searching After Translation
Preview FirstOne TableOne Deck
Tables to Lists
GoodBestWorstUser Friendly
S/2CS/2N/2Average DeckAccess Time
![Page 47: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/47.jpg)
Performance Evaluation
5.4%57.2%16,891
7.4%46.7%11,232
3.5%22.0%7,440
280,7278,32521,203
126,7406,13717,937
176,3619,47124,359Experiment #1
Experiment #2
Experiment #3
Headers Text
Source (bytes) Images(bytes)
WithImages
WithoutImages
ReductionHTML PagesWMLDecks(bytes)
25.2%40.3%12,06217,96620,3639,568Experiment #4
![Page 48: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/48.jpg)
Performance Evaluation (Experiment #1: What’s WAP)
Preview
Deck 1
Deck 3.2
Deck 3.1
What’s WAP
Preview
Deck 3Deck 2Deck 1
WAP Forum
![Page 49: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/49.jpg)
Performance Evaluation (Experiment #2: NTHU Web Page)
Preview
NTHU
Preview
Deck 1
Preview
Deck 1 Deck 2.1
Deck 2.2
Current Status
Preview
Deck 1 Deck 2.1
Deck 2.2
History
Deck 3.1
Deck 3.2
About NTHU
![Page 50: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/50.jpg)
Performance Evaluation (Experiment #3, NTHU CS Web
Page)
Preview
Deck 1
Deck 3.2
Deck 3.1
Faculty
Preview
Deck 1
NTHU CS
Deck 3.4
Deck 3.3
Deck 3.6
Deck 3.5
![Page 51: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/51.jpg)
Performance Evaluation (Experiment #4, IETF Web Page)
Preview
Deck 1
IETF
Preview
Deck 1 Deck 2.1
Deck 2.2
Internet-Drafts
Preview
Deck 1
Deck 2.2
Deck 2.1
Internet-Drafts Index
Deck 2.4
Deck 2.3Deck 2.5
Preview
Deck 1
Deck 2.2
Deck 2.1
DNSOP
Deck 2.4
Deck 2.3Deck 2.5
![Page 52: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/52.jpg)
Implementation
Goal: Portability, reusability, and crash protection.
Translation server: under Java environment with Java Servlet, Java HTML Tidy, and XML Parser for Java.
Servlet-enable server: Avenida Web Server and Nokia WAP Server
Microsoft Windows NT Workstation 4.0 with Service Pack 5
![Page 53: Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001](https://reader030.vdocuments.net/reader030/viewer/2022032703/56649d275503460f949fdea8/html5/thumbnails/53.jpg)
Summary
• Design an HTML to WML transcoding system with1. Analyzing and filtering HTML contents2. Image/icon resizing3. WML browsing mode design and WML
conversion tool4. compression and decompression
modules of the WML data. 5. WML transmission control