[ieee 2009 seventh international workshop on content-based multimedia indexing (cbmi) - chania,...
TRANSCRIPT
Special Session: Multimedia Indexing for Content Based Search
Marco BrambillaPolitecnico di Milano
P.zza L. da Vinci 32 - 20133 Milano , [email protected]
Francesco Saverio NucciEngineering Ingegneria Informatica
Via San Martino della Battaglia 56 - 00185 [email protected]
Worldwide, the volume of stored information is growing
exponentially, and an increasing share is audiovisual con-
tent. This content drives the demand for new services, mak-
ing audiovisual search one of the major challenges for or-
ganisations and businesses today. Digital data is the greatest
value that many organisations possess, and the ability to use
it, rather than just store it, will be one of the most important
strategic aspects in the coming decade.
In this scenario, several research efforts focused on
studying advanced search architectures for enabling con-
sumers, businesses, and organisations to unlock the val-
ues found in audiovisual content through innovative access
paradigms. In particular, the focus of current researches is
on managing and enabling access to information sources
of all types, supporting advanced audiovisual processing
and content handling that will enhance control, creation,
and sharing of multimedia for all users in the value chain.
Several research projects financed by the European Com-
mission tackle this problem from different perspectives and
provide diverse visions for the future. This will impact in
the mid-long term on audiovisual industry, allowing com-
panies to provide more effective and efficient access to con-
tents thanks to innovative annotation techniques and search
paradigms.
The aim of this CBMI special session, organized with the
support of the PHAROS project1, is to offer an overview of
the research initiatives at European level that address the
problems related to processing, annotation, indexing, and
provisioning of contents within search applications. The
session includes nine peer-reviewed contributions, reported
in this volume, and an invited speech.
The invited speech, given by professor Stefano Ceri,
from Politecnico di Milano, Italy, will deliver a vision-
ary discussion on the topic of Search Computing, a novel
multi-disciplinary science which will provide the abstrac-
tions, foundations, methods, and tools required to answer
cross-domain search queries, that cannot be addressed by
1PHAROS (Platform for searcHing of Audiovisual Resources across
Online Spaces) Integrated Project (IST-2005-2.6.3) financed by the EC IST
6th Framework.
current search engines. A typical example of multi-domain
query is “Where can I attend an interesting Information Re-
trieval conference close to a sunny beach, with direct flight
connection to Europe and having a nice and cheap hotel
accomodation?”. The generality of the problem makes it
extremely relevant for the information retrieval community
and poses additional challenges to the field of multimedia
content annotation and indexing. The Search Computing
project is currently financed by the ERC under the IDEAS
Advanced Grants programme.
The other contributions to the session include a work by
Daras and Axenopoulos, that present a novel view-based
approach for 3D object retrieval, that exploits automatic
generation of a set of 2D images from a 3D object for cal-
culating a global shape similarity between two 3D models
that can support multimodal queries.
The work by Bozzon, Brambilla, and Fraternali dis-
cusses the use of a model-driven approach for specifying
multimedia indexing processes, verifying properties of in-
terest in such processes, and generating the code that or-
chestrates the components, so as to enable rapid prototyp-
ing of content analysis processes in presence of evolving
requirements.
Zidouni, Quafafou, and Glotin focus on the role of struc-
tures in named entity retrieval inside audio transcription.
They exploit this information, extracted through Condi-
tional Random Fields (CRFs), for deducing an optimal hier-
archical structure of the space of concepts(named entities),
which are represented by nodes or any sub-paths in the hi-
erarchy.
The paper by Schreer et al. reports on the experiences
achieved by the RUSHES EU research project in the field of
multimedia analysis and reuse of raw un-edited audiovisual
content, providing insights on automatic semantic annota-
tion, indexing and retrieval of raw content, as well as on the
main features of the final RUSHES search engine.
Ren and Jose present their work on exploitation of the
media document representation called feature terms to gen-
erate a query from multiple media examples, such as im-
ages. Their approach helps feature accumulation from mul-
2009 Seventh International Workshop on Content-Based Multimedia Indexing
978-0-7695-3662-0/09 $25.00 © 2009 IEEE
DOI 10.1109/CBMI.2009.54
113
tiple examples and enables the exploration of text-based
retrieval models for multimedia retrieval. Their work dis-
cusses three optimisation criteria for feature term selection
and two ranking functions for relevance estimation.
The paper by Dunker et al., from Fraunhofer IDMT, de-
scribes a solution for automated slideshow generation by
extracting a set of high-level features from music and in-
telligently combining this set with image high-level fea-
tures. The technique allows users to create personal-
ized slideshows intelligently combining music and pictures,
based on their features.
Georges Quenot et al. propose an approach based on the
International Phonetic Alphabet (IPA) for content-based in-
dexing and retrieval of multilingual audiovisual documents.
The approach includes the building of an IPA-based multi-
lingual acoustic model and a dynamic programming based
method for searching document segments by IPA string
spotting. The approach works even if the languages of the
document are unknown.
Cyril Laurier and the Music Technology Group from
Universitat Pompeu Fabra present a robust and efficient
technique for automatic music mood annotation. A songs
mood is expressed by a supervised machine learning ap-
proach based on musical features extracted from the raw
audio signal, that grants robustness to different audio com-
pression schemes.
Smeaton and Rothwell explore three questions related
to automatic detection of film highlights by measuring the
physiological responses of viewers of those films: whether
emotional highlights can be detected through viewer bio-
metrics; whether individuals watching a film in a group ex-
perience similar emotional reactions; and whether the pres-
ence of music in a film correlates with the occurrence of
emotional highlights. The results of this work could impact
the way in which movie content is indexed, e.g., by paying
special significance to movie segments that are likely to be
highlights.
The diverse contributions and skills of the participants
make the session an extremely interesting workbench for
open discussion and cross-fertilization between different
disciplines and across various research project consortia.
The contributions also give a hint on the creative and pro-
ductive area of multimedia search and indexing in the con-
text of the EU-funded research.
Marco BrambillaFrancesco S. Nucci
114