patterns for personalization on the web

49
Semantic Patterns for Web Personalization Lora Aroyo [email protected] Web & Media Group Faculty of Computer Science VU University Amsterdam, The Netherlands http://www.cs.vu.nl/~laroyo twitter: @laroyo

Upload: lora-aroyo

Post on 27-Jan-2015

108 views

Category:

Technology


0 download

DESCRIPTION

Lora Aroyo, talk at http://www.ec.tuwien.ac.at/trends

TRANSCRIPT

Page 1: Patterns for Personalization on the Web

Semantic  Patterns  for  Web  Personalization  

Lora  Aroyo  [email protected]  

Web  &  Media  Group  Faculty  of  Computer  Science  

VU  University  Amsterdam,  The  Netherlands      

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 2: Patterns for Personalization on the Web

the  personalization  challenge  

•  discover  useful  linked  (open)  data  pa4erns    – domain-­‐specific  

– representa8on-­‐specific  – alignment-­‐based    

•  combine  seman8cs  with  user  context  

•  determine  user  relevance  and  ranking  

•  generate  meaningful  explana8ons  

•  select  suitable  presenta8on  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 3: Patterns for Personalization on the Web

Application  Domains    @  VU  Amsterdam  

Page 4: Patterns for Personalization on the Web

what’s  interesting  for  me  in  the  museum?  

Artwork  Recommendations  &  Personalized  museum  guide  

http://chip-­‐project.org  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 5: Patterns for Personalization on the Web

museum  metadata  &  vocabularies  

•  Metadata  format  is  Dublin-­‐Core  specializa8on  –  ARIA  database:  729  artworks;  47,329  triples  –  Adlib  database:  16,156  artworks;  400,405  triples  

•  Vocabularies  –  RM  Dic8onary  (#486),  RM  Encyclopaedia  (#690),  RM  Catalogue  (#43)  

–  Ge4y  TGN  (#425,517),  Ge4y  ULAN   (#1,896,936),  Ge4y  AAT(#1,249,162),  IconClass  (#  24349)  

•  (Manual)  Alignments  –  ~4000  alignts.:  ARIA  to  ~750  concepts  (Ge4y  and  IconClass)  –  (AdLib)  to  ~4500  concepts  (Ge4y)  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 6: Patterns for Personalization on the Web

enriched  rijksmuseum  collection  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 7: Patterns for Personalization on the Web
Page 8: Patterns for Personalization on the Web

what  can  we  do  with  semantics?  

•  Generate  automa8cally  (personalized)  tours  – adapt  tours  on  the  fly  – combine  spa8al,  temporal  &  seman8c  constraints  

•  Generate  automa8cally  recommenda3ons  – cluster  &  classify  – related  artworks  – related  art/history  concepts  – boost  the  ‘interes8ngness’  &  ‘serendipity’  factors  

•  Generate  automa8cally  explana3ons  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 9: Patterns for Personalization on the Web

semantic  recommendations  

Page 10: Patterns for Personalization on the Web

semantic  artwork  presentation  

Page 11: Patterns for Personalization on the Web

semantic  explanations  

Page 12: Patterns for Personalization on the Web

how  did  we  start  …  

WordNet  patterns  for  query  expansion  

Page 13: Patterns for Personalization on the Web

patterns  of  semantic  relations  in  WordNet  

•  Hollink,  et.  Al  (2007)  

Page 14: Patterns for Personalization on the Web

11  semantic  relationships  

• Wang,  et  al  (2009a,  2009b)  •  link  two  art  concepts  within  one  vocabulary  or  across  two  different  vocabularies,  e.g.  – Rembrandt  (ULAN)  –studentOf-­‐>  Pieter  Lastman  (ULAN)  – Rembrandt  (ULAN)  –hasStyle-­‐>  Baroque  (AAT)  

– Rembrandt  (ULAN)  –deathPlace-­‐>  Amsterdam  (TGN)  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 15: Patterns for Personalization on the Web

11  semantic  relationships  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 16: Patterns for Personalization on the Web

4  artwork  features  

•  link  an  artwork  &  its  associated  concepts    – The  Jewish  Bride  (Artwork)  –creator-­‐>  Rembrandt  (ULAN)  

– The  Jewish  Bride  (Artwork)  –crea3onSite-­‐>  Amsterdam  (TGN)  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 17: Patterns for Personalization on the Web

results  …  •  vra:creator  &  link:hasStyle  

&  aat:broader/narrower    –  most  accurate  

recommenda8ons  &  most  interes8ng  to  users  

•  ulan:birth/deathPlace  &  tgn:  broader/narrower  

–  have  the  least  values  for  accuracy  and  interes8ngness  

•  vra:subject  &  (subject)  skos:broader/narrower    

–  highest  recall  for  recommended  concepts  &  resulted  in  most  user  ra8ngs  

–  accuracy  and  interes8ngness,  they  score  average  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 18: Patterns for Personalization on the Web

navigation  patterns  •       artwork  -­‐>  creator  -­‐>  style  -­‐>  broader/narrower  styles  •       artwork  -­‐>  creator  -­‐>  teacher/student  -­‐>  styles  •       artwork  -­‐>  subject  -­‐>  broader/narrower  subjects  

artwork

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 19: Patterns for Personalization on the Web

what  to  watch  tonight?  

Personalized  Program  Guide  with  Social  Web  Activities  http://notube.tv    

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 20: Patterns for Personalization on the Web

deciding  what  to  watch  is  difficult  

http://www.cs.vu.nl/~laroyo

Page 21: Patterns for Personalization on the Web

Can  Linked  Data  Help?  can  linked  open  data  help?  

Page 22: Patterns for Personalization on the Web

first  we  …  

•  select  media-­‐related  Linked  Data  •  semantically  enrich  TV  program  metadata  •  define  similarity  measures  for  TV  programs  

•  semantic  content-­‐based  recommendations  

Page 23: Patterns for Personalization on the Web
Page 24: Patterns for Personalization on the Web

TV-­‐related  linked  data  

•  DBPedia,  Freebase,  WordNet(s)  •  TV  genre  typologies,  IMDB,  TV  Anytime,  BBC  Programme  ontology,  (constantly  growing  list)  

•  Expose  TV  metadata  as  Semantic  Web  data  •  Use  LOD  concepts  for  TV  metadata  enrichment  •  Publish  NoTube  additions  as  extension  to  LOD  •  Combine  and  align  Web  &  TV  standards  (public  broadcasters)  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 25: Patterns for Personalization on the Web

enrichment  of  TV  metadata  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 26: Patterns for Personalization on the Web

semantics  &  linked  data  @  BBC  

•  BBC  Programs  and  BBC  Music  ensure  ONE  page  per  programme  (ar8st)  with  RDF  representa8on  

•  BBC  Program  Ontology  

•  BBC  Wildlife  Finder  provides  a  URI  for  every  species,  habitat  and  adap8on  

•  The  BBC’s  World  Cup  site  uses  RDF  and  Linked  Data  for  a  site  of  700  aggrega8on  pages  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 27: Patterns for Personalization on the Web

many  interesting  facts  but  also  much  straight  forward  knowledge,    

e.g.  “Peter  Jackson  is  a  human  being”  is  necessary,  but  a  trivial  fact  from  a  user’s  perspective    

LOD  is  BIG  &  MESSY  

Page 28: Patterns for Personalization on the Web

source  for  noise  in  LOD  …  

• Multiple  (large)  vocabularies  with  various  semantics  

• Multiple  alignments  between  vocabularies  Content-­‐based  recommendations  with  a  wide  range  of  concepts  

•  Not  all  semantically  related  concepts  are  interesting  for  end  users  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 29: Patterns for Personalization on the Web

to  filter  out  the  noise  in  LOD  …  

we  look  for    patterns  in  LOD    

to  improve  performance  of  semantic  search  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 30: Patterns for Personalization on the Web

how  did  we  do  it  …  

•  select  the  appropriate  LOD  sources  – detect  representative  knowledge  patterns  

– Identify  pattern  types  –  higher  recall/similar  precision  • generic  patterns,  i.e.  hierarchical  &  associative  •  specific  patterns  -­‐  less  applicable,  but  rendering  better  performance  than  generic  patterns    

– enrich  the  data  according  to  those  patterns  •  extract  all  possible  pathway  patterns  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 31: Patterns for Personalization on the Web

method  

•  List  of  all  Properties  (P)  as  defined  in  their  vocabulary  (with  domain  and  range)  

•  P  Statistics  -­‐  #  triples  that  use  it,  universes  and  %  of  use  of  subject  &  object  types  

•  Align  P  to  top-­‐level  P  in  general  Content  ODPs  

– mappings  -­‐    owl:equivalentProperty,  rdfs:subPropertyOf  

•  Align  P  universes  to  top-­‐level  classes  in  ODPs  

•  Identify  paths  http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 32: Patterns for Personalization on the Web

paths  

•  ordered  list  of  properties  from  triple  sequences  that  instantiate  the  path  – a  length  (min  2)  =  #  properties  that  compose  it  – a  number  of  occurrences  =  #  of  its  instances  in  dataset  

•  Property  has    position  in  path,  subject  and  object  types  – linkedmdb:cinematographer, linkedmdb:performance, linkedmdb:film_character!

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 33: Patterns for Personalization on the Web

where  do  we  use  all  this  …  

for  recommendations  of  content  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 34: Patterns for Personalization on the Web

recommendations  with  patterns  

•  reduce  the  burden  of  too  much  choice  – filter  out  irrelevant  items  – push  relevant  background  items  – surface  programs  of  interest  in  the  ‘long  tail’  

•  support    – (interesting)  content  discovery  – serendipity  – knowledge  building  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 35: Patterns for Personalization on the Web

finding  interesting  relations  

•  deep  links    •  related  info  •  granularity  of  content    

– for  discussion  – for  user  feedback  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 36: Patterns for Personalization on the Web

distributed  context  

© danbri http://www.cs.vu.nl/~laroyo

Page 37: Patterns for Personalization on the Web

cross-­‐domain  recommendations  

•  domain  independent  content  patterns  

•  context  (in-­‐)dependency  

•  cross-­‐application  

•  cross-­‐domain  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 38: Patterns for Personalization on the Web

generating  explanations  •  Help  users  to:  

–  learn  the  recommendation  mechanisms  

–  understand  why  something  is  recommended  

–  quicker  share  recommended  content  

–  give  better  feedback  to  the  recommender  engine  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 39: Patterns for Personalization on the Web

relevance  to  the  user?  

© danbri http://www.cs.vu.nl/~laroyo

Page 40: Patterns for Personalization on the Web

next  we  …  

•  select  only  the  LOD  pa4erns  that  match  relevance  for  a  given  user  e.g.  using  the  user  profile  &  context  

•  find  rela8ons  between  a  user  and  program  – interes8ngness  factor  – serendipity  factor  – context  factor,  e.g.  8me,  loca8on,  device  

Page 41: Patterns for Personalization on the Web

FOAF  (Friend-­‐of-­‐a-­‐Friend)  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

User  Profile  schema:  capture  user  context  &  temporal  changes  

User  Modelling:    (Social)  Web  user  activity  &  user  preference  data  

Page 42: Patterns for Personalization on the Web

user  profiling  -­‐  activity  streams  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 43: Patterns for Personalization on the Web

NoTube  BeanCounter:  aggregating  &  profiling  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 44: Patterns for Personalization on the Web

patterns  in  social  media  

•  Twitter  TV  trends  in  people  I  follow  – what  my  friends  are  watching  – what's  most  popular  on  Twitter  right  now  – what  my  celebrities  are  liking  on  FB  

•  Hunch.com  links  between  content  &  people  stereotypes  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 45: Patterns for Personalization on the Web

NOTUBE  DEMONSTRATORS  

© Libby Miller, BBC

•  http://vimeo.com/10553773 •  http://vimeo.com/11232681

http://notube.tv

Page 46: Patterns for Personalization on the Web

NoTube  Demonstrator  I:  Personalized  Semantic  News  

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 47: Patterns for Personalization on the Web

OnlineTV  Guide   SeAop  Box  EPG  

NoTube  Demonstrator  II:  Personalized  EPG  &  Ads  

Mobile  Iden3ty  

•   ID  Anywhere  •   No3fica3ons      

•     Synchroniza3on  with  STB  •     Seman3c  Search  

•   My  TV  Night  •   What’s  on  for  me  •   Related  Programs  

http://ifanzy.nl

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 48: Patterns for Personalization on the Web

NoTube  Demonstrator  III:    Social  TV  &  Web  

•  http://vimeo.com/10553773 •  http://vimeo.com/11232681

http://www.cs.vu.nl/~laroyo twitter: @laroyo

Page 49: Patterns for Personalization on the Web

Acknowledgements  &    Image  Credits  

•  Libby  Miller,  BBC  •  Vicky  Buser,  BBC  •  Dan  Brickley,  VUA  •  Guus  Schreiber,  VUA  •  Natalia  Stash,  TUe  •  Yiwen  Wang,  TUe      •  Peter  Gorgels,  RMA  

•  http://pidgintech.com  •  Stoneroos  team  •  RAI  team  

http://www.cs.vu.nl/~laroyo twitter: @laroyo