people’s preferences, perceptions and frequency of using
TRANSCRIPT
People’s preferences, perceptions and frequency of using search engines to search for images
A study submitted in partial fulfillment of the requirements for the degree of
Msc Information Systems Management
at
THE UNIVERSITY OF SHEFFIELD
by
Junyu Li
September 2013
2
Abstract
With the increase growth of image retrieval, it becomes significant to analysing people’s
perceptions on image retrieval. The frequency, perceptions and preferences of people searching
for images are very important to develop search engines. In this research, 84 participants
completed the questionnaire, and all of them are students or staff in the University of Sheffield.
The survey was designed to find out people’s behaviours on searching for images and which
search engine was the popular engine for image search. In the research, the most popular search
engine was Google and then Bing. On average, people would use search engines to search for
images 5 to 10 times per week, and search for job/academic needs 1 to 5 times per week. Most
people use search engine to search for images to prepare presentation, find out what something
looks like and travel destination. They carry out image search for curiosity, entertainment, gaining
knowledge, solving problems and job/academic need. 90.5% participants thought their last image
search was successful and the search engine was easy to operate. Only 9.5% participants thought
it was unsuccessful. Among unsuccessful participants, most people used Google search by image
function and thought it was hard to operate. In the end, participants thought search engine need
to return more relevant images than irrelevant images, improve the quality or size of the
returned images, provide related information about the image, and provide more search options
or improve the interface.
3
Acknowledge Firstly, I would like to express my appreciation for the helps offered by my dissertation
Supervisor Dr. Robert Villa, who always gives me great supports. He always encouraged me
and helped me build confidence and gain courage throughout the whole process of finishing the
dissertation.
Secondly, I would like to thank all of the participants who finished the questionnaire and help
me to finish this research, and I would like to thank my friends, because they give me enough
help to conduct this research and write this dissertation, especially they help me to carry out
pilot test successfully.
Finally, I would like to express my gratitude to my family, who always support me to complete this
research project.
4
Contents
Abstract ................................................................................................................................. 2
Acknowledge ......................................................................................................................... 3
1. Introduction ................................................................................................................... 7
1.1 Research aim ........................................................................................................................... 7
1.2 Objectives ................................................................................................................................ 8
Section 1: ................................................................................................................................... 8
Section 2: ................................................................................................................................... 8
Section 3: ................................................................................................................................... 8
1.3 Structure ................................................................................................................................. 8
2. Literature review .......................................................................................................... 10
2.1 Introduction .......................................................................................................................... 10
2.2 Image retrieval development ................................................................................................ 10
2.3 Types of image retrieval and limitation ................................................................................. 11
2.3.1 Text based image retrieval ............................................................................................. 11
2.3.2 Content based image retrieval ....................................................................................... 12
2.3.3 Limitations of image search ........................................................................................... 13
2.4 Image search behaviour ........................................................................................................ 13
2.4.1 Professional users’ search behaviours ........................................................................... 14
2.4.2 Search query affects peoples’ search behaviours .......................................................... 15
2.4.3 Similarity of images affects peoples’ search behaviours ................................................ 16
2.5 Conclusion ............................................................................................................................. 16
3. Methodology ................................................................................................................ 17
3.1 Introduction .......................................................................................................................... 17
3.2 Quantitative methodology .................................................................................................... 17
3.3 Qualitative methodology ...................................................................................................... 17
3.2 Critical incident Technique .................................................................................................... 18
3.3 Survey design ........................................................................................................................ 19
3.3.1 Questionnaire questions Q1-Q10................................................................................... 19
3.3.1 Questionnaire questions Q11-Q19................................................................................. 19
3.4 Implement ............................................................................................................................. 20
3.4.1 Participant ...................................................................................................................... 20
3.4.2 Ethical Aspects ............................................................................................................... 21
3.4.3 Pilot test ......................................................................................................................... 21
3.4.3 Questionnaire granting .................................................................................................. 22
4. Result ........................................................................................................................... 23
4.1 Demographic information Q1-Q4 ......................................................................................... 23
4.2 Perceptions regarding searching for images Q5-Q10 ............................................................ 24
5
4.2.1 Q5-Q7 frequency of searching for images ..................................................................... 24
4.2.2 Q8 the reason why carry out image search ................................................................... 25
4.2.3 Q9-Q10 which search engine is popular and why .......................................................... 26
4.3 Most recent experience of searching for images .................................................................. 28
4.3.1 Q11 when participant carried out the last image search. .............................................. 28
4.3.2 Q12-Q13 topics and reasons carrying out last image search ......................................... 28
4.3.3 Q14-Q16 search engine and difficulty & satisfaction of search engines ........................ 30
4.3.3 Q17-Q18 successful and unsuccessful with the last image search ................................ 32
4.3.3 Q19 aspects of search engine to improve ...................................................................... 34
5. Discussion .................................................................................................................... 36
6. Conclusion ................................................................................................................... 39
6.1 Summary ............................................................................................................................... 39
6.2 Limitation .............................................................................................................................. 40
6.3 Future study .......................................................................................................................... 40
Reference ............................................................................................................................ 41
Appendix A Questionnaire ................................................................................................... 45
Part One ...................................................................................................................................... 46
Part Two ...................................................................................................................................... 47
Part Three .................................................................................................................................... 49
Appendix B Ethics Proposal .................................................................................................. 53
Appendix C Ethics Information Consent ................................................................................ 59
Appendix D Approval Letter ................................................................................................. 61
6
Table Contents
Table 1 Q1-Q2 gender and age scale of people who participated in the survey .................. 23
Table 2 Q3-Q4 Education background and occupation of participants ................................ 24
Table 3 Q5-Q7 Three different frequencies of searching for images. Frequency 1 was on average image search frequency, 2 was search for images in job/study, and 3 was search for images outside job/study. 1-5 means 1-5 times per week, 5-10 means 5-10 times per week, 10-20 means 10–20 times per week, and >20 means more than 20 times per week. ............................................................................................................... 25
Table 4 Q8 Why carry out image search. The table shows the reasons why carried out image
search, and used Likert Scale to rank those reasons. In the scale, 1 to 5 were used, 1 means very unimportant, and 5 means very important. In the table, Range means how many numbers participants used to measure the reasons. ............................................ 26
Table 5 Q10 the ranking of reasons for choosing image search. In the table, reasons for
choosing search engines were listed, and used Likert Scale to rank those reasons. In the scale, 1 to 5 were used, 1 means very unimportant, and 5 means very important. In the table, Range means how many numbers participants used to measure the reasons. ... 27
Table 6 Q11 When was the Last time participants searched for images. ............................. 28
Table 7 Q12 Topics participants chose to search for images in their most recent image search
......................................................................................................................................... 29
Table 8 Q13 Reasons participants chose to search for images in their most recent image search .............................................................................................................................. 29
Table 9 Q12-Q13 correlation of topics and reason participant chose to carry out last image
search. In the table, Frequency was used to analysis the correlation of topics and reasons. ........................................................................................................................... 30
Table 10 Q14-Q16 How participants satisfied with the search engine, how difficulty they felt
to operate the search engines, and which search engine were used to carry out the most recent image search ............................................................................................... 32
Table 11 Q17 the possible reasons for participants having a success experience in image
search. In the table, Likert Scale was used to rank those reasons. In the scale, 1 to 5 were used, 1 means very unimportant, and 5 means very important. ........................... 33
Table 12 Q18 the possible reasons for participants having a failure experience in image
search. In the table, Likert Scale was used to rank those reasons. In the scale, 1 to 5 were used, 1 means very unimportant, and 5 means very important. ........................... 34
Table 13 Q19 In which aspects, search engines for image search can be improved (multiple
choices can be made) ...................................................................................................... 35
7
1. Introduction
As digital technologies have improved rapidly nowadays, the means of multimedia information
retrieve become various. Image retrieve become more significant for users, according to Yoon’s
research (2011), People search images for multiple utilizations, such as using for computer’s
wallpaper, demonstrated related items on slides, finding some unknown item’s pictures and
photos, etc. Image retrieve was not only based on text annotation, but also based on the content
of images. On one hand, image’s visual features can make itself delivered as either
complementary resources with text documents or independent resources. On the other hand,
the visual features of images generate different users’ needs and search behaviours, so it is
crucial to figure out user’s search behaviours and it can also help to improve the quality of image
retrieval. Image indexing and retrieval systems mostly rely on the computation of similarity
measures between images (Tirilly et al, 2012) and how easy to operate search engine would
conduct different search behaviour (Zhang, 2006). Therefore, those factors would also need to
consider when studying on people’s behaviour on image retrieval.
1.1 Research aim
This study aims to investigate how people use image search engines, people’s behaviours on
using image retrieval systems and why people succeed or fail searching for images. In this
research, several questions will be investigated, and overall aim of this research is to find out the
reason why people use image search engine and how people behave when searching for images.
Firstly, investigate the frequency of people using of image search engines and which search
engine is people’s favourite one and why. Then investigate people’s behaviour through image
retrieval, such as which is the way people used to search image, what their perceptions after
search engine giving the results, are whether they satisfied with those search results, if the
results are not satisfied their need, is the reason would be similarity between the search results
or the operation difficulty of search engines, or other unexpected seasons. The aims to
investigate people’s behaviour through image search would be important for building image
search system. The similarity of images could influence people’s need and the degree of
satisfaction with the results after searching certain types of image. University students may be a
specific group in searching image. In assumption, they would use image retrieval both in personal
way and academic way. This research will focus on the group of students in the University of
Sheffield to study their behaviours on image retrieval and to prove above assumptions. All the
questionnaires and Likert Scales are anonymous, and will be accessed by online channel. The
participants are mainly focus on the students of University of Sheffield. Questionnaires and Likert
8
Scales are sent by email system in the University of Sheffield. Overall practicalities of this
research are with less ethical risks because recruiting people online is relatively easy and safe,
and the number of participant is only 100 more or less in this research which is not too large to
investigate.
1.2 Objectives
Section 1: To investigate why people use search engine to search for images and the
frequency of using it.
RQ 1.1 How often do people use search engine to search for images and why do they carry out
image search?
RQ 1.2 Which search engine to search for images do people usually use and which search engine
is most popular with people and why?
Section 2: To investigate people’s behaviours on image retrieval.
RQ 2.1 How easy do people feel about operating search engines to search for images?
RQ 2.2 How happy do people feel about the search results?
Section 3: To figure out why people success or fail in image searching.
RQ 3.1 Do people feel successful when conducting an image search, and why they feel
successful?
RQ 3.2 Do people feel failure when conducting an image search, and why they feel
unsuccessful?
RQ 3.3 Is there any aspects of search engines need to improve?
1.3 Structure
In this dissertation, six chapters would be used to explain this research. Firstly, Chapter one is
introduction, in this section, the background, aims, objectives and relevant concepts of this
research will be expressed. Secondly, literature review will be conveyed in Chapter two, in which
critically reviews will contain several areas, such as development of image search, types of image
search, and people’s behaviours on searching for images. After that, Chapter three will address
the methodology applied in this research, which includes the appropriate methodology to collect
data, the way to design the questionnaire, and the implement of those research questions.
Besides, in Chapter four, results of this research after data collecting will be described, and the
9
appropriate tables and charts will be used to display the questionnaire data of the research.
Fifthly, Chapter five is discussion part, in which significant and interesting findings will be
discussed, and the main points about data also need to analysis. Finally, the last Chapter is
conclusion. In this section, summary of this research project, limitation of this research and some
recommendations of the future work will be discussed.
10
2. Literature review
2.1 Introduction
Image retrieval is a sort of data search, but specialized for image search. To search for images,
users need query term, such as keyword, image file uploaded, or click on search condition. After
that image retrieval systems would return images matched users’ queries. With the rapid
development of computing technologies, all aspects indicate the growth of image search. As the
increased needs of image retrieves, and the difficulties of images annotations, the queries of
images described by users would be a problem for image search (Chen & Rasmussen, 1999). The
famous semantic gap in image retrieves is the discrepancy between the query a user ideally
would and the one it actually could submit to an information retrieval system. Therefore, to rise
up the precise of image index or to reach an agreement on the content search system returned
and the images users required became difficult. Image annotation and search models building
draw lots of interest of researchers, and recently there are increasingly researchers focus on
image retrieval (Rui et al, 1999).
Although, some possible improvements of image search are still imperfect, researchers in
different fields develop different types of image retrieves. One is content-based image retrieval
and another is text-based image retrieval, researchers use both of them to find ‘similar’ images
to match users’ queries. However, researchers can not only develop the methods of image
search, but also understand users’ underlying questions of images access, for example: how
images are described and what information should be offered to users (Chen & Rasmussen,
1999). Besides, similar images are defined as images of the same object or scene, but viewed
different by users under different conditions (Jegou et al, 2008). If researchers would improve the
image retrieval systems, they may focus on users’ underlying requirements through image search.
Therefore, using creative and useful way may solve those problems, but to understand users’
needs and search behaviours would be important to improve image retrieval systems.
2.2 Image retrieval development
According to Yoon (2011), images can not only be delivered directly from context as
complementary resources of text documents, but also can be searched as primary resources with
distinctive needs of users. As a result of popular use of image search, large search engine
companies like Google and Yahoo! built their own image search library and add this function to
their search engine (Zhang et al, 2006). In commercial way, Google built the largest image library,
11
so that users can use Google’s image search engine by typing query term, and Google would
match key words with textual annotation to return the ‘similar’ images (Fergus et al 2005). While
for Yahoo!, the same way was applied to its image search function, and this made Yahoo! become
the strongest competitors to Google in the world (Zhang et al, 2006). Recently, new website like
Flickr, which focus only on social network sites’ photos uploading, develops rapidly because of
the huge amount of image needs required by customers in this area. Although, website like Flickr
own millions of images in its database, but it is uneasy for customer to find out the exact one
which they ‘just know’ existed in those search engine database (Neal, 2009). Customer may be
still unsatisfied with those search engines to meet their distinctive needs. In Rieh and Hilligoss
research (2007), credibility would be concerned mostly in image retrieval, because of its effort
level and strategies which people applied on information seeking. However, for college students,
although credibility was one important factor to considerate during information seeking, they
often compromised information credibility for speed and convenience, especially when the
information retrieval was less important. The same way maybe also apply on the image retrieval
that people would not so critically need the credibility when seeking some less consequential
images. Therefore, this would be the reason why people choose to use Google or Yahoo! instead
of professional image retrieval engine, such as Gettyimages, Tineye or Iamgeseek websites.
2.3 Types of image retrieval and limitation
The technologies of image retrievals have developed rapidly in past 20 years, since 1990’s. This
increased search methods offer new ways to solve the semantic gap of images. Although the
problem is still there, users’ needs are gradually satisfied. There are two major search methods.
One is the traditional concept-based indexing, which searches for images by using both daily and
professional words and terms as query conditions, or describing what an images is or what it is
about in natural language. Another is newly developed techniques called content-based image
retrieval, which is rely on the analysis of image content through pixel-level of images (Chen &
Rasmussen, 1999).
2.3.1 Text based image retrieval
Usually, people use image search engine with composing a text query and executes this query on
intermediary image databases (Villa et al, 2010). The traditional concept-based or text-based
indexing is widely used in commercial search engines. The method called textual annotations for
image retrieval means adding a note or text manually to an image so that system can match
user’s query with those note. It may employ keywords, subject headings, captions, or natural
language text to annotate images (Chen and Rasmussen, 1999). According to Villa’s research
12
(2011), this is the standard method of online image seeking, and it is applied widely for online
image involved websites such as Flickr. This type of image search required a large set of human
labelled annotation. In 2001, Jorgensen evaluating a conceptual structure for image content
depiction. They categorise visual content into a “pyramid” with 4 syntactic levels
(type/technique, global distribution, local structure, composition) and 6 semantic levels (generic,
specific, and abstract levels of both object and scene, respectively). This experiment indicated
that the visual content structure “pyramid” is theoretically robust, and it can be used to guide the
indexing process and classify the image labels manually and automatically. It is a systematic
category methodology for image indexing.
Although textual annotation of image retrieval is employed commonly, manual annotations are
expensive and waste lots of time and money. Sometimes, web-based image search engines are
always mismatch the annotation with the content of images, the result of querying for those
mismatch images would cluttered with irrelevant data. Furthermore, it is not clear how to select
images through noisy data sets, such as mislabel (Ben-Haim et al, 2006). Therefore, scholars try
to find out if it is possible to automatically annotate images and multimedia, or search images
without annotations. In Villa’s research (2010), they found an algorithm to allow users to search
for images through an intermediate database, which means a user can use query term to find out
a visual example from intermediate database, and then example would be used to search for the
images lack of annotation. This method can solve the problem stated before, when an image is
lack of label or mislabelled, it can still be found in this way.
2.3.2 Content based image retrieval
Furthermore there is another way for image search, which called content based image retrieval
(CBIR). It is for users to query through uploading image files and match similar images as search
result, and this way is also used for image search system (Smeulder et al, 2000). Content-based
image retrieval required computing the appropriate image decomposition and index, and storing
this extracted information for searches based on image content (Sclaroff 1997). This technique
need to build based on a huge image library to analyse multimedia content, however, thanks to
the development of inexpensive hardware and software for image acquisition, storage and
distribution, this encourage an increasing volume of digital images unloaded online to foster the
growth for CBIR (Acharya, 2011). In Acharya’s research they also mentioned that CBIR search
engines can translate users’ query image and search the similar images from the database in
some extent. Nevertheless, CBIR always forced user to describe their requirements with
uploading example images, it would be difficult to find result, if no visual example of the
information need is available (Villa et al, 2010). In order to solve the problem of users uploading
13
picture as query terms, many research tried to develop new CBIR models. In Ben-Haim et al
research (2006) they propose an approach named Re-ranking sets of pictures by Exploiting
Consistency. This method is a hybrid of content-based retrieval and text-based retrieval. In this
method, users can retrieve with keyword and then clusters the results based on extracted image
feature. Another new model about content-based image search is called TSI-pLSA, developed by
Fergus et al in 2005. This method can handle the high intra-class variability and large proportion
of unrelated images returned by search engines.
2.3.3 Limitations of image search
General to say, there exist some problems related to image search, such as irrelevant images,
unclear images and unlabelled images (Ben-Haim et al, 2006). Furthermore user will find difficult
in obtaining the needed images, and developer will find difficult in images annotated
automatically (Covey 2002), or user needs are not taken into account appropriately (Burger
2010). Although Some scholar improved the image search system to handle the high intra-class
variability and large proportion of unrelated images returned by search engines (Fergus et al
2005), it is still difficult to meet individual users requirement according to their search
behaviours, as they might have different image needs in image search (Jorgensen and Jorgensen,
2005). In that way, there are still some limitations of image search, no matter using textual
annotation retrieval or content-based retrieval. Based on these limitations of image retrieval,
some researches were conducted. In Zhang et al study (2006), they root their hypothesis into
users’ behaviours, that customers would be easier to select terms generated by the image search
system rather than describing their target images by typing queries or uploading example images.
Therefore, they developed a method called interactive text-based image retrieval. Also new
technology such as organize search results (Rodden et al, 2001) and automatically annotate
images (Kennedy et al, 2009) should be used to increase the veracity of image search to meet
customers’ image seeking needs.
2.4 Image search behaviour
According to Zhang et al research (2006) about a commercial search engine queries logs, it shows
that more than 20% of image search queries can be categorized as location, nature, and daily life
related requirement. Therefore, image retrieval developed rapidly related to customers’ daily
lives and distinctive needs. Different individuals have different requirements in image retrieval
and the main challenge is an image can convey different meanings for users (Shatford-Layne,
1994). In order to improve image search veracity, researchers cannot only focus on the methods
and algorithms, but study users’ search behaviours as well. Therefore, in order to meet
14
customers’ needs in image search, there are some aspects should be clarified. According to the
book Image and video retrieval, image search behaviour is a complicated interaction involves a
lot of factors, one method to search individuals’ behaviour is that examine and analyse the
transition made by individual user through one search query to another (Goodrum et al 2003).
However, this might be too complicated to study every single user’s search behaviour. According
to Jorgensen and Jorgensen (2005), because there are mass of factors relevant with image
content, images display and images search results would be categorized differently by their
qualities, and users’ visual perception might be various depending on those factors. Basically,
those factors could be reflected directly from image content, such as the size and colour of
images (Jorgensen 2005), therefore those factors are crucial for image retrieval. Furthermore,
search moves, traits and strategies which employed by users are also crucial when studying users’
behaviors through image seeking (Goodrum et al, 2003). As people usually consider some basic
factors when search images, for example images’ colour, sharp, hue, size and mood, those factors
might influence the similarity between image contents and individual’s expectation. Therefore, it
is necessary for developers build image retrieval system to consider those factors which influence
people’s behaviours on image searching (Burger, 2010).
2.4.1 Professional users’ search behaviours
Most recent researches are focus on professional users’ search behaviours. In Burger’s report
(2010), there are six factors affect reuse multimedia search, including images. For those arts
professionals who want to reuse media objects, they use keywords or classification information
in high frequency. Among six factors, Practices and feedback, visual and compositional features
are most important factors affecting search behaviours. Another search (Westman et al, 2008)
indicated when searching for an image, journalism professionals and non-professionals would be
affected by task type. Users always choose to combine content-based and textual search models
together to find visually tasks. On the other hand, conceptual tasks only required large number of
text queries and categories. Therefore, different classified image tasks need different search
methods. In another research, Fukumoto (2006) observed and tested undergraduate students’
search behaviours that were given certain image retrieval tasks. A popular search strategy was
found in this research: At first, student uses one or two keywords as query term to search for
images, and then the returned images were viewed. Finally the “back to home” or “previous
page” button was used to begin another search. According to this result, the author proposed a
new interface as “quick-browse” for an image retrieval system.
15
2.4.2 Search query affects peoples’ search behaviours
Choi (2010) analysed users’ queries variation for image searches on the web. In the article,
study’s finding showed that query iterations varied with different task goals, job background, and
search expertise. Furthermore, the length of query would also vary with different image contents
and correlated with search engine operations. Jorgensen and Jorgensen (2005) reported an
analysis of search logs from a commercial image search web. In their report, users’ query
modification strategies were analysed and they made a conclusion that Boolean searching
seemed convenient but turn out ineffective and lead to query modification. Besides, descriptive
and thematic queries are more and more applied as query terms in image search retrievals. In
Pu’s research (2008) the failure of queries chose for web image search was analysed. The report
showed that queries would fail if it had high specificity or expertise, especially for some emotion
involved responses or abstract and conceptual refinements that required intellectual
interpretation (e.g., topic, genre, geographic, role, story, event, emotive, and abstract). In Chung
and Yoon’s research (2009), they analysed the characteristics of image search queries, especially
compared image attributes (abstract, generic, and specific) with basic metadata. The results
demonstrated that generic category appeared most frequently, users would use specific and
abstract attributes to search for images less frequently than generic attribute, and the colour
category was used least frequently.
Eakins, Briggs, and Burford (2004) made an experiment of images retrievals users, tried to find
out the requirements of users when search for images. The results demonstrated that
participants with high level of experience of searching for images would like to show high level
demand queries as search strategies rather than lower level features. In addition, the participants
were more likely to use keywords as query terms rather than menu choice or uploading example
image to search for an image. As the result, the authors made a conclusion that current CBIR
systems are inadequate, especially the interfaces part, which cannot satisfy user needs when
users cannot use keyword without example to search for images. Therefore it is the time to
develop new query paradigms. Yoon (2011) analysed user behaviour as the proliferation of
images in daily life. In this research, most participant chose Google or Google Image as search
engines to search for images, because they are familiar to users and have a user-friendly
interface. Although study finding indicated a high success rate of image retrieval, users still
wanted search engine could return more relevant images rather than irrelevant. Furthermore,
the search also found that student search images for various purposes, such as wallpaper of PC,
prepare for presentations, travel destinations and so on. This article analyse people’s needs of
image retrieval in daily life, but does not analyse how people operated image search engines.
16
2.4.3 Similarity of images affects peoples’ search behaviours
In order to understand people’s behaviour on image search, it must be rooted in the
understanding of factors which influence peoples’ action through seeking images. In this way,
some scholar became to think the image’s similarity, whether it is a factor that influences
people’s search strategies, and if it would help to understand why the unrelated image search
result ratio is high at some extent. Little is known about how humans assess the similarity in the
case of complex images (Tirilly et al, 2012). However, one definition of two images similarity can
be found that if they are images of the same object or scene viewed under different imaging
conditions, those two images are similar (Jegou et al, 2008), or if they describe objects or scenes
from the same category and scale, those two images are similar (Fergus et al, 2005). So that if
customer find unsatisfied with the image search results, it would be necessary to check if the
concept of similarity is common to the different subjects of the search engine (Jorgensen et al
2001). To respect to an image query with low unrelated results, measures of image similarity are
needed to apply (Jegou et al, 2008), image retrieval systems aim at providing relevant images to
their users, but are actually mostly relying on the notion of image similarity (Tirilly et al, 2012).
Indeed, measures of image similarity are important to use in image retrieval system in order to
respect to an image query.
2.5 Conclusion
In conclusion, these sources provide a complete background about image retrievals, the types
and paradigms of image retrievals and some professionals and non-professionals behaviours of
image retrieval, and the potential issues of image retrievals. Furthermore, it can be concluded
that most researchers focus on professional users’ search strategies of image retrieval, but for a
commercial search website, they may be more interested in normal users’ needs of image
retrievals. Therefore, it is important to carry out this research. Users’ backgrounds and habits
may affect their search strategies, and some image search behaviour may originate from a user’s
daily life, job task, or academic life. These underlying factors affecting search strategies have not
been analysed yet. Based on above articles and reports, normal people’s search strategies should
be analysed to find out the way how to improve image search paradigms.
17
3. Methodology
3.1 Introduction
In this research, quantitative methodology will be mainly applied to make a survey of
questionnaire, and some questions would use qualitative methods to analysis. In the survey, the
quantitative questions and Likert Scale (Hodge, 2005) will be applied to investigate people’s use
of image search in daily life and academic life. Critical Incident Technique (Flanagan, 1954) will be
also applied in this research, to investigate users’ most recently experience of searching for
images. All methodologies are used to understand people’s experience of carrying out image
searches, including people’s preferences, perceptions and frequency of use of image search
engines.
3.2 Quantitative methodology
During the late 19th and throughout 20th century, quantitative researches have been used to
study in many fields, such as psychology, economics, sociology, history, physics etc. Among those
quantitative strategies of search, these are survey, experiments, the less rigorous experiments
called quasi-experiments, correlational studies (Campbell & Stanley, 1963) and specific single-
subject experiments (Neuman & McCormick, 1995), which invoked positivist world view. This
research uses survey as a main methodology because it studies a sample of participants with a
series of questions and provides a quantitative or numeric description of trends, attitudes, or
opinions of a certain population. In this research, the data of people’s behaviours of image search
cannot be collected from every user of search engines; therefore, study a sample of people
becomes important for this research. Survey is just the way to study a sample of people, so it will
be used as a main methodology in this research. Besides, survey methodology applies principles
to the sample participants in the following steps. Firstly, design the relevant questions, and then
collect the data from participant as analysing sample, finally, process data and analysis data.
Although survey methodology is a quantitative technique, it still has errors and bias when
analysing statistics.
3.3 Qualitative methodology
In order to cover the errors or bias of survey, some qualitative method should be used. In
qualitative research, it tends to collect data directly from participants in the field of studying
issues. In the survey, in order to study the user’ perception of image search, some qualitative
18
questions would be asked directly to investigate the objects RQ1.1 and RQ1.2 (seen in the
Introduction Chapter). Qualitative method observes or interviews participants directly and
studies participant experiences about the issue or problem, therefore, it will be fit for
investigating questions related to people’s perception. Furthermore, Qualitative method is not
similar as quantitative one. Participants do not need to fill questionnaire forms or follow
instruments of experiments. Mostly participants would interact with the researchers to collect
data (Creswell, 2009). Using qualitative method as early stage analysis with literature reviews to
make sure the research issue will help to carry out questionnaire survey as main methodology
and can cover some bias of quantitative method.
3.2 Critical incident Technique
Another method called the Critical Incident Technique (CIT) is used in this research to assist with
quantitative method. CIT studies human behaviours by asking them describing a set of
procedures in a certain “incident” (Flanagan, 1954). Critical incidents can be gathered in several
ways, normally participants are asked to describe their experience about the “incident”. The
means of descriptions can be writing down their experience in their own words, recording their
interviews or answering some certain questions of the “incident”. In my research, the CIT is used
as part of the questionnaire with several questions, to investigate RQ3.1, RQ3.2 and RQ3.3 (seen
in the Introduction Chapter). Those research questions are mainly about people’s most recent
experience of search for images and the experience would be a kind of incident at some extent. A
critical incident can make a significant contribution to the search issue. These questions are then
kept track of the “incidents” and used the “incident” data to solve relevant problems. Therefore,
CIT will be useful to investigate people’s behaviours on the most recent image search, and also
helpful to analysis underlying “incidents” of searching for images.
Recognising that all methodologies are imperfect and have limitations, I combined quantitative
and qualitative methods together to reduce bias as many as possible in any single method, and
wish to neutralize or cancel the biases of other methods. Mixed methods strategies are not
famous as quantitative or qualitative methodology but more practical in real life researches. In
1959, Campbell and Fiske developed the new term called multimethods to describe mixing
different methods together. They used multimethods to research in psychology field, especially
for the validity of psychological traits. After the success of utilizing their mixing methods, they
encourage other researchers to employ their multi-method matrix to study whether multi-
methods can be used to collect data in other fields. Therefore, I used quantitative method to
make questionnaire and combine scale, critical incident methodology together to carry out this
19
research. Furthermore, using mixed methods can also helped to analysis different data collected
from samples.
3.3 Survey design
3.3.1 Questionnaire questions Q1-Q10
In this research, several questions were investigated, and overall aim of this research is to find
out the reason why people use search engine to search for images and how people behave when
using it. In the introduction Chapter, 3 main sections and 7 research questions of object were
designed in questionnaire with 19 questions. Besides, the questionnaire was designed based on
Yoon’s research called searching images in daily life. In his research, he focused on people using
search engines to search for images in daily life, and how they respond when searching for
images.
Firstly, the questionnaire was split into three parts. Part 1 included Q1-Q4 (Seen in Appendix A),
and those four questions were used to investigate the participants’ demographic, including age,
gender and their education background. In order to analysis simply, all the questions in this part
are close questions and design based on general survey. Then, in part 2, there are 6 questions.
The question number is Q5-Q10. (Q5. On average, how often do you use search engines to search
for images? Q6. How many times per week do you use search engine to search for images in your
job or study? Q7. How many times per week do you use search engine to search for images
OUTSIDE of your job or study?) Those questions were designed based the RQ 1.1 to investigate
participants’ frequency of using search engines to search for images. After that, Q8 (How
important is image search to you for the following tasks?) was designed to investigate the reason
why people used search engines based on RQ 1.1. Q9 (Which search engines do you normally use
to search for images?) and Q10 (Which of the following are most important to you when
conducing image searchers?) were designed based on RQ 1.2 to investigate the most popular
search engine and the reasons. Finally, an open question is offered for participant to list more
reasons. In those two parts, Q1 to Q10 were mainly designed to solve RQ 1.1 and RQ 1.2.
3.3.1 Questionnaire questions Q11-Q19
In part 3, Critical Incident Technique and Likert Scale are applied to investigate participants’ most
recent experience of searching for an image. Q11 (When was your most recent search for an
image) was designed to begin the investigation of search “incident” and this is a precondition to
investigate participant’s “incident”. Q12 (Why did you carry out this search) and Q13 (What was
20
the topics on which you searched?) were asked the reason that they carried out this search and
what kind of topic of image did them search for. Another key issue need to investigate is which
search engine they used in their most recent image search (Q14 what was the search engine you
used for this search) and how easy they operated this search engines (Q15 How difficult was this
search, to find relevant images?). As reasons why people use image search engines would be a
personal question various with individual, utilizing Likert Scale can measure this perception in a
quantitative traits. Likert Scale is a quantitative methodology to measure people’s perception of
using image retrieval, so this method is used to ask about how people feel when operating search
engines. Besides, Q16 was designed to find participants felt successful or not for this image
search. If they felt successful, they should rank the scale of the reasons in Q17 (Please rate the
following statements, if you felt you were not successful in your image search (Jump to Q19)),
and if they felt unsuccessful, they also need to rank the reasons in Q18 (18. Please rate the
following statements, if you felt you were successful in your image search) of the questionnaire.
Finally Q19 (Please select ways the image search engine you used can be improved) was designed
to know which aspects of search engines should be improved and it will helpful for the future
study in this area. Questions in part 3 were mainly designed based on the RQ 2.1, 2.2, 3.1, 3.2,
3.3. Those questions were focus on their perceptions after search engine giving the results. The
results of this questionnaire would be important for building image search system. The similarity
of images could influence people’s need and the degree of satisfaction with the results after
searching certain types of image.
3.4 Practical
3.4.1 Participant
In order to carry out this research, the staff and students of the University of Sheffield became
the underlying participants. University students and staff may be a specific group in searching
image. In assumption, they would use image retrieval both in personal way and academic way.
This research will focus on the group of students and staff in the University of Sheffield to study
their behaviour on image retrieval and to prove above assumption. As a student or staff in the
university, they may need to search for images in different way and for different needs, especially
they can search for both job/academic needs and normal life needs. However, there will be some
special reasons of searching for images listed by students or staff in the university, and those
reasons may not be widely accepted. Nevertheless, choosing staff and students in the university
as participants can fit for this questionnaire, especially for investigating the academic/job needs
of image search.
21
3.4.2 Ethical Aspects
In this research, no sensitive or confidentiality questions are contained. However, it directly
involves people in the research activities. As research involves human participant, ethical
problems are inevitable (Silverman, 2011). In this case, questionnaire and scale do not contain
sensitive questions, so the research would be low-risk. Furthermore letter of approval (Seen in
Appendix D) will be attached with questionnaire sent by email, and all the research questions are
anonymous. Every participant will be informed the purpose of this research (Seen in Appendix B),
the person undertaking and sponsoring the project, the potential risks that may have in this
research and potential benefits that may result. In this way, the whole research would undergo a
low risk to announce.
3.4.3 Pilot test
After finished the draft of the questionnaire, there existed 17 questions, and it is necessary to
take a pilot test which means to send the draft questionnaire with the ethics approval to a few
participants and let them fill it and find out whether it is practical or not. Therefore, before
collecting data from those participants, two people were invited to take the pilot test. During this
test, two things are aimed to achieve. One is to check the way of express in the questionnaire,
and to check whether the questionnaire convey clearly or not. Because the questionnaire might
have some ambiguous description of image search, participant might misunderstand some
questions. Another aim of pilot test is to check the structure of the questions, whether it could
use to collect the precise and sufficient data from the participants.
After taking the pilot test, these two people found that the question asked the frequency of using
search engines was unclear and easy to misunderstanding, especially the options of this question.
Although three question were designed to ask frequency of using search engines, those
questions’ options were different. After pilot test, those options were unified as the same pattern
to ask the frequency and it became easy to answer in the final questionnaire. In the draft
questionnaire, it did not contain the question of searching topics and difficulty, but the topic will
affect the difficulty of finding images. Therefore, the topics of last image search and how
difficulty participants felt when operating the search engines were added into the questionnaire
in order to investigate the difficulty of image search (Q14 and 15). In the draft questionnaire, Q17
and Q18 once were open questions, which were designed to ask whether participants feel
successful or not when carried out last image search and participants should also write down
their own reasons in those two questions. In the pilot test, participants mentioned that those
questions would be difficult to answer and people may lose their patient when answering such
22
questions. Therefore, those questions was modified after pilot test, especially listed the possible
reasons for successful or unsuccessful image searches. After pilot test and modification, the
questionnaire contained 19 questions in the end.
3.4.3 Questionnaire distribution
The questionnaire was designed and sent out under the guide of the research in University of
Sheffield and obeyed the ethics approval. The questionnaire contains 3 parts with 19 questions,
and sent out by email through the University of Sheffield mail system and in the section of
“student volunteer”. With this survey email, ethics approval and information consent (Seen in
Appendix C) about this research was attached and sent to participants. Participants can read the
information consent to understand the aims of the research. This research used Google Docs to
create an online survey of questionnaire last for 15 days, since 15/07/2013 to 31/07/13. The
survey email was sent to all the students and staff in the University of Sheffield and finally 84
people in the University of Sheffield participated in this survey.
23
4. Result
4.1 Demographic information Q1-Q4
In this research, the survey was divided into 3 parts, part one is about participants’ demographic
information, part two is about people’s perceptions of searching for images and part three is
about people’s most recent experience of searching for images. In the survey, 84 participants
completed the survey. Among them, 38.1% were male and 61.9% were female (Table 1). The
number of female participants was approximately 1.5 times the size of male participants. 58.3%
of the participants’ age ranged from 18 to 24, no matter the participant was male or female.
Nearly 1 out of 5 male participants’ age ranged from 18 to 24 and 38.1% female participants aged
18 to 24. In different age range, in the range of 25-30, gender rate was 1.5 (female: male),
however, in the range of 31-40, male participants were more than female. In Table 2, 86.9% of
the participants were students, only 13.1% of participants were employed. Among students,
42.9% were Master student, 28.6% were undergraduate student, and 6.0% were doctoral
student. However most employed were undergraduates.
Table 1 Q1-Q2 gender and age scale of people who participated in the survey
Age Total
18-24 25-30 31-40 41-50 Over 50
gender
Male Count 17 7 7 0 1 32
% of Total 20.2% 8.3% 8.3% 0.0% 1.2% 38.1%
Female Count 32 13 5 1 1 52
% of Total 38.1% 15.5% 6.0% 1.2% 1.2% 61.9%
Total Count 49 20 12 1 2 84
% of Total 58.3% 23.8% 14.3% 1.2% 2.4% 100.0%
24
4.2 Perceptions regarding searching for images Q5-Q10
4.2.1 Q5-Q7 frequency of searching for images
In the part two of the questionnaire, participants were asked to describe the frequency of using
image search systems to search for images. Three different frequencies of searching for images
were asked to describe, and 4 scales were used to analysis (Table 3). In the scale, 1-5 represent “1
to 5 times per week”, 5-10 represent “5 to 10 times per week”, 10-20 represent “10 to 20 times
per week” and >20 represent “more than 20 times per week”. Furthermore, Frequency 1 means
on average, how often they used search engines to search for images. In this question, 34.5%
Table 2 Q3-Q4 Education background and occupation of participants
occupation Total
Employed Student
education
Doctorate
Count 0 5 5
% within
education
0.0% 100.0% 100.0%
Further education
Count 0 2 2
% within
education
0.0% 100.0% 100.0%
High school
Count 0 2 2
% within
education
0.0% 100.0% 100.0%
Master
Count 3 36 39
% within
education
7.7% 92.3% 100.0%
PG dip
Count 1 0 1
% within
education
100.0% 0.0% 100.0%
Professional
Count 1 4 5
% within
education
20.0% 80.0% 100.0%
Undergraduate
Count 6 24 30
% within
education
20.0% 80.0% 100.0%
Total
Count 11 73 84
% within
education
13.1% 86.9% 100.0%
25
people believed they searched for images 1 to 5 times per week on average, 27.4% participants
searched for images 5 to 10 times per week, 21.4% participants searched for 10 to20 times and
even 16.7% participants searched more than 20 times per week. Frequency 2 is the frequency of
searching for images in job/study, and 59.5% participants searched for 1 to 5 times per week for
job/academic needs. Frequency 3 is the frequency outside job/study. The participants described
nearly the same frequency in job/study and outside, 48.8% participants searched outside job 1 to
5 times per week, but outside job/study was in higher frequency and scatter wider than in
job/study.
Table 3 Q5-Q7 Three different frequencies of searching for images. Frequency 1 was on average image search
frequency, 2 was search for images in job/study, and 3 was search for images outside job/study. 1-5 means 1-5
times per week, 5-10 means 5-10 times per week, 10-20 means 10–20 times per week, and >20 means more than
20 times per week.
Valid
1 - 5 5 - 10 10 - 20 >20 Total
Frequency 1 29 23 18 14 84
Percent 34.5% 27.4% 21.4% 16.7% 100.0%
Frequency 2 50 24 7 3 84
Percent 59.5% 28.6% 8.3% 3.6% 100.0%
Frequency 3 41 23 11 9 84
Percent 48.8% 27.4% 13.1% 10.7% 100.0%
4.2.2 Q8 the reason why carry out image search
The reason why they carried out image search was asked after the frequency of using search
engine to search for images. Different participants gave the different answers. The survey list 8
reasons for participants to rank (1 is very unimportant, 5 is very important). Most participants
thought that finding out what something looks like was the most important reason among all
options to search for images (Table 4). Then curiosity ranked No.2 among those reasons but less
important than finding out what something looks like. People also search for image in need of
preparing presentation and finding out travel destination. Verifying names and getting the idea of
purchasing were also the important reasons why people carried out an image search. For the
reason finding beauty/fashion items, it varied widely between participants, and the numbers of
the 5 scales chosen by participants were primarily equal. Besides those listed reason, participants
also wrote down other reasons for image search. One participant wrote that searching for images
to enhancing email communication and enriching class hand-outs. Another participant wrote that
finding out original image and photograph was important especially those images were reposted
26
too many times. Two participants wrote that sharing amusing memes with others was the reason
to search for images, which means replying others on the internet with small images and photos
to enhance your ideas. There are still some other reasons for conducting an image search, such
as: finding out porn, program or apple icons, adding to blogs etc. Therefore the further discussion
of this question would be presented in the next section.
Table 4 Q8 Why carry out image search. The table shows the reasons why carried out image search, and
used Likert Scale to rank those reasons. In the scale, 1 to 5 were used, 1 means very unimportant, and 5
means very important. In the table, Range means how many numbers participants used to measure the
reasons.
Range Mean Std. Deviation
Something looks like 3 4.50 .720
Curiosity 4 3.99 .814
Prepare presentation 4 3.93 1.159
Travel destination 4 3.79 1.233
Verify names 4 3.48 1.156
Ideas for purchasing 4 3.35 1.187
Beauty/Fashion items 4 3.10 1.314
Computer wallpaper 4 2.51 1.266
Valid N (listwise)
4.2.3 Q9-Q10 which search engine is popular and why
In part two, participants were also asked to describe which search engine systems they usually
used for image search (Figure 1). Q9 was a multiple choices question, and participants can
choose as many options as they wanted. Furthermore, participants can choose search engines
even they only used once. As the results, the majority of participants used Google as their
favourite search engines, and using search by image (73, 87.0%) more than web search system
(64, 76.2%) to search for images. After that, Bing was ranked No.3, and 10.7% participants chose
it as their usually used image search engines. Then, 9.5% participants used Baidu to search for
images, which is a Chinese search engine and 2.4% participants chose Tineye, which is a reverse
image search engine in professional image search. 2.4% chose Flickr, which is image hosting
website as image search engine, 1.2% participant chose Ask, which is an American search system
and 1.2% participant used Yandex, which is a Russian search engine. Participants were also asked
the reason why they chose those search engines to search for images (Table 5). After Q9, some
reasons were listed for participant to rank (1 is very unimportant, 5 is very important) in Q10. The
highest ranked reason to choose a search engine is that the chosen search engine would return
27
good search results based on participants’ search query. This reason ranked 4.69 was higher than
the second reason which was the familiarity with search engines. Then the user-friendly interface
and returning large numbers of various images were two important factors for choosing search
engines to search for images. Offering relevant text with images, Availability of copyright-
unrestricted images and sharing images with others were all ranked above 3 but less than 4 in the
survey as factor for choosing search engines.
Table 5 Q10 the ranking of reasons for choosing image search. In the table, reasons for choosing search
engines were listed, and used Likert Scale to rank those reasons. In the scale, 1 to 5 were used, 1 means very
unimportant, and 5 means very important. In the table, Range means how many numbers participants used
to measure the reasons.
Range Mean Std. Deviation
Good results 4 4.69 .711
Familiarity 3 4.35 .736
Interface 4 4.43 .749
Image amount 4 4.07 1.039
Relevant text 4 3.81 1.081
Copyright 4 3.37 1.138
Ability to share 4 3.15 1.256
Valid N (listwise)
Figure 1 Q9 Popular searcher engine to search for images (multiple choices can be made)
73 87.0%
28
4.3 Most recent experience of searching for images
4.3.1 Q11 when participant carried out the last image search.
After analysing above questions, the most recent experience of searching for images of
participants were asked. From question 11 to question 19 were all about this experience. The
question 11 was asked when participant last time carried out image search (Table 6). Most
participants (38.1%) thought they carried out the most recent image search 1-3 days ago. Then
about 42.9% participants carried out image search within a day in total. Nearly 10% thought they
carried out an image search more than 4 days ago but within a week. Therefore, primarily 91.7%
participants carried out a search for images within a week, and only 8.3% participants carried out
an image search more than one week ago.
Table 6 Q11 When was the Last time participants searched for images.
Frequency Percent Cumulative Percent
Valid
Within an hour 6 7.1% 7.1%
1-2 hours ago 10 11.9% 19.0%
More than 3 hours ago but
within a day
20 23.8% 42.9%
1-3 days ago 32 38.1% 81.0%
More than 4 days ago but
within a week
9 10.7% 91.7%
More than one week ago 7 8.3% 100.0%
Total 84 100.0%
4.3.2 Q12-Q13 topics and reasons carrying out last image search
In Q12 and 13, the topics and the reasons to carry out last image search were asked (Table 7,
Table 8, and Table 9). Nearly 52.4% people carried out last image search for curiosity or
entertainment. About 19% participants carried out image search for job/academic needs, and
37.5% participants who searched for job/academic needs searched medical related material or
plants/animals related images last time. In the reason of gaining knowledge, most participants
(27.3%) select art related materials as search topics, and in the reason of problem solving, 75.0%
participants searched for travel places/destination. 16.7% of all the participants chose figure of
people as search topics and they always carried out either for curiosity or for entertainment
reason. Different participants had different search topic. In the survey, 15 possible topics were
listed and every topic had its own chooser. There were still 8.3% participants chose others option
in Q12. Therefore, the topics and reasons for carrying out image search varied with different
29
participants.
Table 7 Q12 Topics participants chose to search for images in their most recent image search
Frequency Percent Cumulative Percent
Valid
News 3 3.6% 3.6%
Plants/Animals 6 7.1% 10.7%
Cook/Food related material 7 8.3% 19.0%
Fashion/Beauty items 3 3.6% 22.6%
Arts related materials 6 7.1% 29.8%
Sports related material 3 3.6% 33.3%
Places 12 14.3% 47.6%
Mechanism related material 1 1.2% 48.8%
Electronics products 3 3.6% 52.4%
Information technology related material 3 3.6% 56.0%
Medical related material 8 9.5% 65.5%
Films, Books or Music related material 8 9.5% 75.0%
Figure of People 14 16.7% 91.7%
Others 7 8.3% 100.0%
Total 84 100.0%
Table 8 Q13 Reasons participants chose to search for images in their most recent image search
Frequency Percent Cumulative Percent
Valid
Curiosity 24 28.6% 28.6%
Entertainment 20 23.8% 52.4%
Job/ academic needs 16 19.0% 71.4%
Gain knowledge 11 13.1% 84.5%
Problem solves 8 9.5% 94.0%
others 5 6.0% 100.0%
Total 84 100.0
30
Table 9 Q12-Q13 correlation of topics and reason participant chose to carry out last image search. In the table,
Frequency was used to analysis the correlation of topics and reasons.
Reason Total
Curiosity Entertainment Job/ academic
needs
Gain
knowledge
Problem
solves
others
topics
News 1 0 1 1 0 0 3
Plants/Animals 0 1 3 1 0 1 6
Cook/Food related
material
3 1 0 2 0 1 7
Fashion/Beauty items 1 0 0 1 0 1 3
Arts related materials 0 2 1 3 0 0 6
Sports related material 1 2 0 0 0 0 3
Places 4 2 0 0 6 0 12
Mechanism related
material
0 0 0 1 0 0 1
Electronics products 0 2 0 1 0 0 3
Information technology
related material
0 0 2 0 1 0 3
Medical related material 1 0 6 0 0 1 8
Films, Books or Music
related material
4 3 0 1 0 0 8
Figure of People 8 6 0 0 0 0 14
Others 1 1 3 0 1 1 7
Total 24 20 16 11 8 5 84
4.3.3 Q14-Q16 search engine and difficulty & satisfaction of search engines
When asked which search engine the participant used for last image search, the majority of
participants (96.4%) used Google as the search engine, no matter web search or search by image
(Table 10). Only 1 (N=84) participant used Yahoo!, 1 (N=84) participant used Baidu (Chinese), and
1 (N=84) participant used Bing to carry out most recent image search. 63.1% participants thought
using search engines to search for images was easy to operate (Figure 2), and 90.5% participants
thought they were satisfied with the search results, and felt successful with this search (Figure 3).
The majority people (61.9%) believed that Google web search was easy to operate and satisfied
with its search result. In this survey, only 9.5% participants felt unhappy with this search. Among
those participants, 37.5% participants thought this image search was neither successful nor
31
failure, and only 12.5% participants thought this search was very failure. Among those
participants who felt unsuccessful, 12.5% used Google search by image, 25.0% used Google web
search. Then 4.7% out of all participants believed this image search was failure and they all used
Google search by image and among them 75% people thought search by image was somewhat
difficult to operate and 25% thought it was very difficult. Finally, 1.2% participant used Baidu and
felt very difficult to operate and very unsatisfied with the search engine and cannot find out what
he/she searched.
Figure 2 Q15How difficult to operate search engine to search for images
Figure 3 Q16 Satisfaction of search engine to search for images
2.4% 4.8%
11.9%
17.8% 63.1%
50.0% 40.5%
3.6% 4.7%
1.2%
32
Table 10 Q14-Q16 How participants satisfied with the search engine, how difficulty they felt to operate the
search engines, and which search engine were used to carry out the most recent image search
Satisfaction Difficulty Total
1
(Easy)
2 3 4 5
(Difficult)
Very successful, I
found exactly
what I required
Search
engine
Google (web search) 14 1 0 0 15
Google (search by image) 22 2 2 1 27
Total 36 3 2 1 42
Successful
Search
engine
Google (web search) 4 8 3 15
Google (search by image) 11 3 3 17
Yahoo! 0 1 0 1
Bing 1 0 0 1
Total 16 12 6 34
Neither
successful nor
failure
Search
engine
Google (web search) 0 2 2
Google (search by image) 1 0 1
Total 1 2 3
Failure
Search
engine Google (search by image)
3 1 4
Total 3 1 4
Very failure, I did
not find what I
required
Search
engine Baidu (Chinese)
1 1
Total 1 1
Total
Search
engine
Google (web search) 18 9 5 0 0 32
Google (search by image) 34 5 5 4 1 49
Yahoo! 0 1 0 0 0 1
Baidu (Chinese) 0 0 0 0 1 1
Bing 1 0 0 0 0 1
Total 53 15 10 4 2 84
4.3.3 Q17-Q18 successful and unsuccessful with the last image search
In Q17, after analysing the results of participants’ perception about most recent experience of
search for images, the reason why they felt successful or unsuccessful were asked. In the above
question, there 76 (90.5%) participants felt successful with their last time image search. However,
only 74 (88.1%) finished the reason ranking box about how they felt the search would be success
33
(Table 11). The ranking box listed some reasons about being successful with the search results
and using 1-5 scale to rank. It was the same with the unsuccessful part. In the successful part,
most participant thought that the search engine had user-friendly interface and easy to operate
are the most important reasons to feel satisfy with the search results, and return relevant images
about the search query terms was also an important reason. Then, some participants believed
that image search engine in the native language would be a reason to choose for searching for
images, but some did not. Returned images with high resolution and copyrights seemed less
important for those participants to feel satisfy with the search engines.
In Q18, for the unsuccessful part (Table 12), although only 8 people ranked the reasons, there
were still some interesting findings. The reason that search engine returned irrelevant images as
results was the most important reason for the participants to feel unhappy with the image
search. Then, images with low resolution and the search engine unfamiliar to the searchers were
less important reason for them to feel unhappy. They did not think that search engine difficult to
operate and images with the wrong size were the important reasons to feel unsatisfied. No
matter for the participants satisfied with search engine or not, images with copyrights and
unlimited to use would not be the reason to conduct those feelings.
Table 11 Q17 the possible reasons for participants having a success experience in image search. In
the table, Likert Scale was used to rank those reasons. In the scale, 1 to 5 were used, 1 means very
unimportant, and 5 means very important.
N Range Mean Std. Deviation Variance
Statistic Statistic Statistic Std. Error Statistic Statistic
Easy to operate 74 2 4.86 .044 .382 .146
Quickly 74 1 4.80 .047 .405 .164
Relevant 74 2 4.61 .066 .569 .324
Native language 74 4 4.22 .147 1.264 1.596
High resolution 74 4 3.97 .110 .950 .903
With copyrights 74 4 3.07 .109 .941 .886
Valid N (listwise) 74
34
Table 12 Q18 the possible reasons for participants having a failure experience in image search. In the
table, Likert Scale was used to rank those reasons. In the scale, 1 to 5 were used, 1 means very unimportant,
and 5 means very important.
N Range Mean Std. Deviation Variance
Statistic Statistic Statistic Std. Error Statistic Statistic
Irrelevant 8 2 4.00 .267 .756 .571
Low resolution 8 4 3.25 .526 1.488 2.214
Unfamiliarity 8 3 3.00 .423 1.195 1.429
No copyrights 8 3 2.88 .441 1.246 1.554
Wrong size 8 3 2.75 .453 1.282 1.643
Difficult to use 8 3 2.00 .423 1.195 1.429
Valid N (listwise) 8
4.3.3 Q19 aspects of search engine to improve
After research participants’ perceptions about the most recent search experience, they were
asked to select which aspects should be improved of the search engines and the options of this
question were multiple choose that participants can choose as many as they want. 66.7%
participants thought they wished those search engines could return more relevant images and
fewer irrelevant images, although 90.5% people felt satisfied with the search results of those
search engines. Then, 50% participants thought that search engines should improve the quality
or size of the returned images and provide related text information about those images. 45.2%
participants needed more related information about the images, and 28.6% participants thought
that providing more search options or improving the interface should be important for those
search engines. 23.8% participants thought search engines should improve the way that returned
images displayed, and the same number of people thought search engines should display the
copyrighted images or where those images came from. In the other options, only one person
chose the others option and thought that search engine was fine enough, no more place need to
improve.
35
Table 13 Q19 In which aspects, search engines for image search can be improved (multiple choices can
be made)
Frequency Percent of
participant
Percent of
response
Cumulative
Percent
Valid
Returning more relevant images and
fewer irrelevant images
56 66.7 27.5 27.5
Improving the quality or size of the
returned images
42 50.0 20.6 48.0
Providing related information about the
image
38 45.2 18.6 78.4
Providing more search options or
improving the interface
24 28.6 11.8 59.8
Improving the way that returned images
are displayed
20 23.8 9.8 88.2
Displaying copyrighted images or images
from commercial websites
20 23.8 9.8 98.0
Others 4 4.7 2.0 100.0
Total 204 242.8 100.0
36
5. Discussion
This research indicated University students and staff’s perceptions on using search engines to
search for images in job/study and outside. Most people participants in this survey were female,
because they may be more patient than male. Besides, this survey was carried out through
university mail system, so student participants were more than staff, and most students were
undergraduate students.
Firstly, in this study, different participants had their different frequency of image search relating
to the RQ1.1. Although most people would search for images 5-10 times per week, more than
16% participants thought they would search for image more than 20 times per week.
Furthermore, participants would like to search images 1-5 times per week for job/academic
needs and more or less the same times for personal reasons. However, the overall people chose
more times per week for personal reason search than job/academic needs. Although in their
most recent experience of image search, less than 20% people described that they searched
images for job/academic need, they all ranked job/academic need was an important reason for
them to search for images. In the survey, the most significant reason to carry out image search is
to find out what something looks like, and then is for curiosity. After that is the reason to prepare
presentation and find out the travel destination. In Yoon’s research (2011), finding images as
computer wallpaper was an important reason for image search, however, in this survey, most
people thought it was unimportant and only ranked 2.51 on average (1=very unimportant, 5=very
important). For the reason to find beauty/fashion items, the overall score was 3.10 on average,
but for male, the score was only 2.66 and for female, the score was 3.25. There exists huge
difference in this reason for image search. The study found that female felt finding
beauty/fashion items was more important than male.
Secondly, according to RQ1.2, 2.1 and 2.2, most people choose Google (web search) and Google
(search by image) as their most favourite search engines to search for images. The reason to
choose this search engine is that it would return good image results. After that, familiarity and
user-friendly interface were another two crucial reasons to choose this search engine to search
for images. Therefore, image searchers usually value the search results than other aspects of
search engine. A good search engine can satisfy user with good and relevant search results. This
would be important for search engine development. In the survey, participants still chose
returning more relevant images and fewer irrelevant images as an aspect that search engines
need to improve. Therefore, no matter how interface improved and large amount of images
returned, the most important respect need to improve is relevant image results. As most people
37
choose Google as search engine and chose the reason why used it is that it can return good
search results. Therefore, Google may return more relevant images than other search engines so
that people were more likely to use it. In Fergus’s research, users can use Google’s image search
engine by querying key world, and Google would match key words with textual annotation to
return the results. While for Yahoo!, the same way was applied to its image search function, and
this made Yahoo! become the strongest competitors with Google in the world (Zhang et al, 2006).
However, in the survey, it was not hard to tell that only 6.0% participants choose Yahoo! as search
engine. Therefore, in some extent, Yahoo! cannot be the strongest competitor with Google.
Furthermore, Google no longer only used textual annotation for image search. It also applied
CBIR in “search by image” section for image search. In the survey, around 90% participants used
“search by image” function. Although they may misunderstand the meaning of “search by image”
and obscure this function with “image search”, they would still be a lot of people used “search by
image” function before. So in the survey, the most popular search engine to search for images
was Google.
Thirdly, CIT was applied to analyse the most recent experience of searching for images in order to
answer the RQ 3.1, 3.2. In Sudman & Bradburn’s study, time and other factors would affect
memory on response in surveys (1973). However, if the stimulate of memory were too strong,
and recalled by people several times, the memory would not fade but strengthen. In the survey,
participants were asked to answer when their most recent image search carried out, and more
than 80% participants carried out image search within 3 days. Therefore, the CIT of their most
recent image search would be at some extent reliable and accurate. In this study, most people
felt Google was easy to operate when searching for images, and Bing and Yahoo! were not hard
to operate. People who used Baidu (Chinese) found it was not easy to operate. Different people
carried out last image search for different reasons. Most people searched for curiosity or
entertainment reasons, and for this reason most participants chose figure of people as search
topic. Job/academic need and Gain knowledge were another two important reasons to carry out
last image search, and for those reasons, most people chose medical related images. In problem
solves reason, most people chose places as topics. They might believe that Google map was
another kind of image or they just simply searched the images showed the view of the place.
Fourthly, in the survey, most people felt successful with this image search. Only 8 people felt
unsuccessful with this image search (Table 10). In CIT, “incident” has its significant value to study,
so those 8 “incident” case would be analysis in details. In those “incident” cases, 2 people were
male and 6 were female, and the ratio of female to male was higher than overall ratio. Female
would feel more unsatisfied with the search results than male. After that, among those 8
38
participants, 4 people felt failure and they all used “search by image” function. Among those 4
people, 3 participants thought “search by image” function was not easy to operate and 1
participant thought it was difficult to search. “Search by image” function is a CBIR search option,
and users should upload images as query terms to find similar images as search results. The
search process might be complex for some users, so they may felt difficult to operate. However,
because they may misunderstand the meaning of “search by image”, it could be that people felt
hard to operate image search function. Nevertheless, most people used Google (both web search
and search by image) for last image search and felt easy to operate the search engine, and the
image search interface was almost exactly the same with web search. Therefore, if those people
misunderstanding “search by image”, they may not feel hard to operate. Google search by image
would hard to operate and users did not satisfy with the search results. Besides, Tineye is also a
CBIR search engine, and users need to add images’ URL or uploaded image as search query.
However, no one used it as search engine for last image search. Therefore, CBIR might not be
widely used by people or not as popular as TBIR search engine. Furthermore, only one people
used Baidu (Chinese) to search for images in those 8 cases, felt difficult to operate the search
engine and did not satisfy with the search results. The reason for that was return irrelevant
images.
Finally, according to RQ3.3, no matter people feel satisfied with the search results or not, they all
wished search engines can return more relevant images and less irrelevant images. The reasons
for participants feel satisfy with the search engine is because it returned relevant images quickly
and in native language, and the reasons for not satisfaction is because it returned irrelevant
images and with low resolution. Therefore, people thought search engines need to improve its
search results. Furthermore, some people thought images should be up to date and with nice
quality or changeable size. Providing related information was also another aspect to improve.
However, people did not care whether the image displayed copyrights or not, not matter
participant satisfied with the search results or not.
39
6. Conclusion
6.1 Summary
With the increase growth of image retrieval, it becomes significant to analysing people’s
perception on image retrieval. The frequency, perceptions and preferences of people searching
for images are very critical to develop search engines. In this research, 84 participants completed
the questionnaire, and all of them are students or staff in the University of Sheffield. The survey
was designed to find out people’s behaviours on the image search and which search engine was
the popular engine for image search according to research questions. In the questionnaire, part
one was designed to collect participants’ demographics. Most participants were undergraduate
and master students, and most of them were female.
In the research, part 2 was designed to collect basic information about participant using search
engines to search for images relating to RQ1.1 and RQ 1.2. In this part, the most popular search
engine was Google, which was used by 87.0% participants in the survey. On average, people
would use search engines to search for images 5-10 times per week. Participants used search
engines nearly the same times to search for images for job/academic needs and personal needs.
They usually searched for both needs 1-5 times per week. Most people use search engine to
search for images to prepare presentation, find out what something looks like, find travel
destination, verifying names etc. They carry out images search for curiosity, entertainment,
gaining knowledge, solving problems and job/academic need.
Finally, in part three, the questions were designed to analyse participants’ most recent
experience of image search and to answer the RQ 2.1, 2.2, 3.1, 3.2, 3.3. 90.5% participants
thought their last image search was successful and the search engine was easy to operate. Only
9.5% participants thought it was unsuccessful. Among them, most people used Google search by
image function and thought it was hard to operate. Participants would search some medical
related materials for job/academic needs, and search people’s figure for curiosity or
entertainment. The reason which people thought this image search was successful would be that
search engines returned relevant images quickly and was easy to operate. And the reason for
unsuccessful image search was that search engines returned irrelevant images or images with
low resolution. In the end, participants thought search engine need to return more relevant
images than irrelevant images. Search engines also need to improve the quality or size of the
returned images, provide related information about the image, and provide more search options
or improve the interface.
40
6.2 Limitation
There are some limitations of this survey. First, the participants in this survey are only from the
University of Sheffield, the age, education background and other personal information may
restrict in a limited range. Furthermore, locational limitation is also a problem, as people around
world have different habit in searching for images, therefore the survey may not represent all the
phenomena in the image search. Although conducting pilot tests, there are still some problems
of misunderstanding or ambiguous terms. Because only two people took the pilot test, there may
exist some issued not be found by those two people. Finally, the survey is only displayed online,
so that it lacks the face to face communication. It is not so precise when studying participants’
behaviours through the survey, especially for some experience questions.
6.3 Future study
More researches should be taken in the CBIR search engine to find out why people thought it was
hard to operate and how to improve the search engines with these reasons. As this research
indicated, although most people satisfied with the search results, people still believed that search
engine should return more relevant results. Therefore, the algorithm of returning relevant image
as search results need to improve, and would be better to combine CBIR and text annotation
together to search for images. People prefer to use search engines that they are familiar with or
the image search interface is user-friendly. Hence, improve the search engine interface and make
it convenient to operate are very important in the future work.
(Word count: 11,772)
41
Reference
Acharya, S., & Devi, M. R. V. (2011). Image retrieval based on visual attention model. Procedia
Engineering, 30(2011), 542–545.
Ben-Haim,N. Babenko B.and Belongie,S.(2006). ImprovingWeb-based Image Search via Content
Based Clustering. IEEE.
Burger,T.(2010). A Model of Relevance for Reuse-Driven Media Retrieval. Proceedings of the 2nd
Workshop on Semantic Multimedia Database Technologies, co-located with the 5th
International Conference on Semantic and Digital Media Technologies (SAMT).
Campbell, D. T., & Fiske, D. W. (1959). Convergent and discriminant validation by the
multitrait-multimethod matrix. Psychological bulletin, 56(2), 81.
Campbell, D. T., Stanley, J. C., & Gage, N. L. (1963). Experimental and quasi-experimental
designs for research (pp. 171-246). Boston: Houghton Mifflin.
Creswell. J.W (2009). Research design: Qualitative, quantitative, and mixed methods
approaches. Sage.
Chen, H.-L., & Rasmussen, E.M. (1999). Intellectual access to images. Library Trends, 48(2), 291–
302.
Choi, Y., & Rasmussen, E. M. (2003). Searching for images: The analysis of users' queries for image
retrieval in American history. Journal of the American Society for Information Science and
Technology, 54, 498–511.
Covey, D.T. (2002). Usage and usability assessment: Library practices and concerns.
Washington, DC: Council on Library and Information Resources, Digital Library Federation.
Eakins, J. P., Briggs, P., & Burford, B. (2004). Image retrieval interfaces: A user perspective. In P.
Enser, Y. Kompatsiaris, N. E. O'Connor, A. F. Smeaton, & A. W. M. Smeulders (Eds.), Lecture notes
in computer science. Image and video retrieval, 3115. (pp. 628–637) Berlin, Germany: Springer-
Verlag.
Fergus, R., Fei-Fei, L., Perona, P., & Zisserman, a. (2005). Learning object categories from Google’s
image search. Tenth IEEE International Conference on Computer Vision (ICCV’05) Volume 1,
1816–1823 Vol. 2.
42
Flanagan, J. C. (1954). The critical incident technique. Psychological bulletin,51(4), 327.
Fukumoto, T. (2006). An analysis of image retrieval behavior for metadata type image database.
Information Processing and Management, 42, 723–728.
Goodrum, A., Bejune, M. and Siochi, A. C. (2003). A state transition analysis of image search
patterns on the web. Image and Video Retrieval, 281-290
Hodge, D. R. & Gillespie, D. F. (2005). Phrase Completion Scales. Encyclopedia of Social Measurement, Vol3, 53–62. San Diego: Academic Press
Jegou, H., Douze, M. and Schmid, C.(2008) Hamming embedding and weak geometric consistency
for large scale image search. In Proceedings of the European Conference on Computer Vision
(ECCV), vol1, 304-317.
Jorgensen, C., Jaimes, A., Benitez, A. B., & Chang, S. F. (2001). A conceptual framework and
empirical research for classifying visual descriptors. Journal of the American Society for
Information Science and Technology, 52(11), 938-947
Jorgensen, C. and Jorgensen, P. (2005). Image querying by image professionals. Journal of the
American Society for Information Science and Technology, 56/12, 1346-1359.
Kennedy, L. Slaney,M. and Weinberger, K.(2009) Reliable tags using image similarity: mining
specicity and expertise from large-scale multimedia databases. In Proceedings of the 1st
workshop on Web-scale multimedia corpus (WSMC), pages 17-24, Beijing, China.
Neal, D. (2009), Introduction. Bulletin of the American for Society Information Science and
Technology, 35: 6–12.
Neuman, S. B., & McCormick, S. (1995). Single-Subject Experimental Research: Applications
for Literacy. Order Department, International Reading Association, 800 Barksdale Road, PO
Box 8139, Newark, DE 19714-8139 (Book No. 128: $11 members, $16 nonmembers)..
Pu, H. -T. (2008). An analysis of failed queries for web image retrieval. Journal of Information
Science, 34, 275–289.
Rieh, S. Y., and Hilligoss, B. (2008). College students' credibility judgments in the information-
seeking process. In M. J. Metzger, & A. J. Flanagin (Eds.), Digital media, youth, and credibility (pp.
49– 72). Cambridge, MA: MIT Press
43
Rodden, K. Basalaj, W. Sinclair, D. and Wood, K. (2001) organisation by similarity assist image
browsing? In Proceedings of the ACM SIGCHI conference,Seattle, WA, United States.
Rui, Y., Huang, T. S. and Chang, S. (1999). Image Retrieval: Current Techniques, Promising
Directions, and Open Issues. Journal of Visual Communication and Image Representation,
10/1, 39-62.
Sudman, S., & Bradburn, N. M. (1973). Effects of time and memory factors on response in
surveys. Journal of the American Statistical Association, 68(344), 805-815.
Sclaroff, S., Taycher, L., & Cascia, M. La. (1997). ImageRover : A Content-Based Image Browser for
the World Wide Web 1 Introduction 2 Approach, 2–9.
Shatford-Layne, S. (1994). Some issues in the indexing of images. Journal of the American Society
for Information Science, 45(8), 583–588.
Silverman, D.(2011).Ethics and qualitative research. Qualitive research, 3rd edition, III,416-438.SAGE publication,Ltd.
Smeulder,A.W., Worring, M., Santini, S., Gupta, A., and Jain, R.(2000). Content-Based Image
Retrieval at the End of the Early Year. IEEE Trans.
Tirilly, P., Mu, X., Huang, C., Xie, I., Jeong, W., & Zhang, J.(2012). On the consistency and features
of image similarity. Proceedings of the 4th Information Interaction in Context Symposium on –
IIIX 12, 164.
Villa, R., Halvey, M., Joho, H., Hannah, D., & Jose, J. M. (2010). Can an intermediary collection
help users search image databases without annotations? Proceedings of the 10th annual joint
conference on Digital libraries
Westman, S., Lustila, A., & Oittinen, P. (2008). Search strategies in multimodal image retrieval. In
M. Lalmas, A. Tombros, P. Borlund, J. W. Schneider, D. Kelly, J. Feather, & A. P. de Vries (Eds.),
Proceedings of the 2nd International Conference on Information Interaction in Context (pp. 13–
20). New York: NY: Association for Computing Machinery.
Yoon, J. (2011). Searching images in daily life. Library & Information Science Research, 33(4), 269–
275.
Zhang, L., Chen, L., Jing, F., Deng, K. and Ma, W. (2006). EnjoyPhoto—A Vertical Image Search
Engine for Enjoying High-Quality Photos. Proceedings of the 14th annual ACM international
44
conference on Multimedia, 367-376.
45
Appendix A Questionnaire
Frequency of using search engines to
search for images
*Required
Information Consent
The University of Sheffield. Information School Researchers: Junyu Li MSc
Information System Management [email protected] Doctor Robert Villa
Lecturer of Information School [email protected] We wish to better
understand people’s experience of carrying out image searches, including people’s
preferences, perceptions and frequency of use of image search engines. This
study aims to investigate how people use image search engine, people’s behaviour
using image retrieval systems and why people succeed or fail searching for images.
This study would be important for building image search systems and will help to
improve the image search systems in the future. We are inviting adults who used
search engines to search images (pictures, photos, maps etc) before. If you are
not, please ignore this survey. The risks of participating are no greater than those
experienced in everyday life. We are collecting only the results of the
questionnaires and no personal identification will be collected. No identifying
information will be collected. All responses will be anonymous. The data will be
stored at the University of Sheffield on a secure server. Your participation will be
kept strictly confidential and the information you provide will only be used for
my dissertation which will be publicly available. Please contact the School in six
months. If you require any addition information, please contact one of the
researchers listed above. I understand that my participation is voluntary and that
I am free to withdraw at any time without any negative consequences. I
understand that I may decline to answer any particular question or questions, by
closing the browse window. I understand that my responses will be kept strictly
confidential, that my name or identity will not be linked to any research materials,
and that I will not be identified or identifiable in any report or reports that result
from the research. I give permission for the research team members to have
access to my anonymised responses. I give permission for the research team re-
use my data for further research as specified above. I agree to take part in the
research project as described above. [Note: If you have any difficulties with, or
wish to voice concern about, any aspect of your participation in this study, please
contact Dr. Angela Lin, Research Ethics Coordinator, Information School, The
46
University of Sheffield ([email protected]), or to the University
Registrar and Secretary.]
By pressing continue *
I confirm I have read the above information sheet, and I agree to take part in the
research project as described above
o I Confirm
Part One
Basic Information about you
1. Please select your gender? *
o Male
o Female
2. Please select your age group *
o 18-24
o 25-30
o 31-40
o 41-50
o Over 50
3. Are you currently or primarily (more than 50% of time) a/an ______? *
o Employed
o Student
o Unemployed
o Other:
4. Please select your educational background *
o High school / Secondary School
o Further education/ College diploma
o Undergraduate
o Master
o Doctorate
o Professional (law,medicine...)
o Other:
47
Part Two
Basic Questions about image retrieves
5. On average, how often do you use search engines to search for images? *
(Images can include pictures, photos, maps, etc.)
o 1 to 5 times per week
o 5 to 10 times per week
o 10 to 20 times per week
o More than 20 times per week
6. How many times per week do you use search engine to search for images in your
job or study? *
o 1 to 5 times per week
o 5 to 10 times per week
o 10 to 20 times per week
o More than 20 times per week
7. How many times per week do you use search engine to search for images
OUTSIDE of your job or study? *
o 1 to 5 times per week
o 5 to 10 times per week
o 10 to 20 times per week
o More than 20 times per week
8. How important is image search to you for the following tasks? *
Very
important Important
Neither important
nor unimportant
Unimportant Very
unimportant
Prepare presentation materials
Use as computer wallpaper
Get ideas for purchasing
Travel destinations
Beauty/Fashion
48
Very
important Important
Neither important
nor unimportant
Unimportant Very
unimportant
items
See what something looks like
Verify names
Out of Curiosity
Are there any other tasks for which you conduct image searches?
9. Which search engines do you normally use to search for images? *
(Please click as many responses as apply)
o Google (web search)
o Google (search by image)
o Yahoo!
o Tineye
o Baidu (Chinese)
o Bing
o Yandex (Russian)
o Ask
o AOL
o Other:
10. Which of the following are most important to you when conducing image
searchers? *
Strongly
Agree Somewhat
Agree
Neither Agree Nor Disagree
Somwhat Disagree
Strongly Disagree
Good search
49
Strongly
Agree Somewhat
Agree
Neither Agree Nor Disagree
Somwhat Disagree
Strongly Disagree
results
Familiarity with the search engine
User-friendly interface
The number of search results and their variety
Availability of relevant text information
Availability of copyright-unrestricted images
Ability to share images with other people
Part Three
Questions about image retrieves experience
11. When was your most recent search for an image *
o Within an hour
o 1-2 hours ago
o More than 3 hours ago but within a day
o 1-3 days ago
o More than 4 days ago but within a week
50
o More than one week ago
12. Why did you carry out this search *
o Curiosity
o Entertainment
o Job/ academic needs
o Gain knowledge
o Problem solves (e.g search for a travel destination, map, etc.)
o Other:
13. What was the topics on which you searched? *
o News related material
o Plants/Animals
o Cook/Food related material
o Fashion/Beauty items
o Arts related material
o Sports related material
o Places (e.g travel destination, map, etc )
o Mechanism related material (e.g motocycle, car, etc)
o Electronics products (e.g laptop, phones, etc)
o Information thechnology related material
o Medical related material
o Films, Books or Music related material
o Figure of People (e.g celebrity, etc)
o Other:
14. What was the search engine you used for this search *
o Google (web search)
o Google (search by image)
o Yahoo!
o Tineye
o Baidu (Chinese)
o Bing
o Yandex (Russian)
o Ask
o AOL
51
o Other:
15. How difficult was this search, to find relevant images? *
1 2 3 4 5
Easy Select a value from a range from 1,Easy, to 5,Di ffi cult,. Difficult
16. How happy were you with this search results? *
o Very successful, I found exactly what I required (Jump to Q18)
o Successful (Jump to Q18)
o Neither successful nor failure
o Failure
o Very failure, I did not find what I required
17. Please rate the following statements, if you felt you were not successful in your
image search (Jump to Q19)
Strongly
Agree Somewhat
Agree
Neither Agree Nor Disagree
Somwhat Disagree
Strongly Disagree
Images were not relevant
Images with low resolution
Images with wrong size
Images were unfamiliar that can't be told whether it was required
No clear copyrights of images
Image search options or interface
52
Strongly
Agree Somewhat
Agree
Neither Agree Nor Disagree
Somwhat Disagree
Strongly Disagree
were difficult to use
Please enter any other reasons you felt led to an unsuccessful image search
18. Please rate the following statements, if you felt you were successful in your image
search
Strongly
Agree Somewhat
Agree
Neither Agree Nor Disagree
Somwhat Disagree
Strongly Disagree
Images were highly relevant
Results of search came out quickly
Search engine was easy to operate
The search engines represented in your native languange
Images with clear copyrights
Images with high resolution
Please enter any other reasons you felt led to a successful image search
53
19. Please select ways the image search engine you used can be improved *
(Please click as many responses as apply)
o Returning more relevant images and fewer irrevelant images
o Improving the quality or size of the returned images
o Providing more search options or improving the interface
o Providing related information about the image
o Improving the way that returned images are displayed
o Displaying copyrighted images or images from commercial websites
o Other:
Appendix B Ethics Proposal Students Staff This proposal submitted by: This proposal is for:
Undergraduate Specific research project
X Postgraduate (Taught) – PGT Generic research project
Postgraduate (Research) – PGR This project is funded by:
N/A
Project Title: People’s preferences, perceptions and frequency of using search engines to search for images
Start Date: July 15, 2013 End Date: August 30, 2013
Principal Investigator (PI):
(student for supervised UG/PGT/PGR research)
Junyu Li
Email: [email protected]
Supervisor:
(if PI is a student)
Doctor Robert Villa
Email: [email protected]
Indicate if the research: (put an X in front of all that apply)
54
Involves adults with mental incapacity or mental illness, or those unable to make a personal decision
Involves prisoners or others in custodial care (e.g. young offenders)
Involves children or young people aged under 18 years of age
Involves highly sensitive topics such as ‘race’ or ethnicity; political opinion; religious, spiritual or other beliefs; physical or mental health conditions; sexuality; abuse (child, adult); nudity and the body; criminal activities; political asylum; conflict situations; and personal violence.
Please indicate by inserting an “X” in the left hand box that you are conversant with the University’s policy
on the handling of human participants and their data.
X
We confirm that we have read the current version of the University of Sheffield Ethics Policy
Governing Research Involving Human Participants, Personal Data and Human Tissue, as shown on
the University’s research ethics website at: www.sheffield.ac.uk/ris/other/gov-ethics/ethicspolicy
55
Part B. Summary of the Research
B1. Briefly summarise the project’s aims and objectives: (This must be in language comprehensible to a layperson and should take no more than one-half page. Provide enough information so that the reviewer can understand the intent of the research) Summary:
Image search is used in many different contexts for many different purposes: for example, some
image search behaviour may originate from a user’s daily life, from a job task, or from academic life.
In this study, I will investigate people’s preferences, perceptions and frequency of using search
engines to search for images, which can help to understand people’s image search requirements, and
how they search images through search engines. This study will investigate people’s most recent
experience of image search. The objectives are to investigate how people use image search engines,
people’s behaviours using image retrieval systems, and why people succeed or fail when conducting
image searches. This study will be important for building image search systems and will help to
improve the image search systems in the future.
B2. Methodology: Provide a broad overview of the methodology in no more than one-half page.
Overview of Methods:
This study will conduct a web-based survey created on Google Docs, and all undergraduate,
postgraduate, doctoral students and staff in the University of Sheffield would be invited to complete
the survey.
If more than one method, e.g., survey, interview, etc. is used, please respond to the questions in Section C for each method. That is, if you are using both a survey and interviews, duplicate the page and answer the questions for each method; you need not duplicate the information, and may simply indicate, “see previous section.”
56
C1. Briefly describe how each method will be applied Method (e.g., survey, interview, observation, experiment): The methods used will be web-based survey only. Description – how will you apply the method? The survey will be created and mounted on Google Docs and available from July 15-30, 2013. The survey will conclude three parts: Part 1 consists of four questions about education, gender, age, etc. to investigate participants’ basic information and draw a demographic profile of them. Part 2 will contain 6 questions about people’s preferences and frequency of using image search engines in both their daily and academic life. Part 3 will contain 8 questions about the participants’ most recent experience of using image search engines including their perceptions and reasons. Most of those questions are closed questions.
About your Participants C2. Who will be potential participants? Potential participants will be undergraduate, postgraduate, doctoral students and staff enrolled in the University of Sheffield C3. How will the potential participants be identified and recruited? The potential participants will be sent an email via the VOLUNTEERS discussion list by Doctor Villa C4. What is the potential for physical and/or psychological harm / distress to participants? The potential for harm or distress is no larger than what might be experienced in everyday life C5. Will informed consent be obtained from the participants?
X Yes
No
If Yes, please explain how informed consent will be obtained? The Information Sheet/Consent Form will be the first page of the survey and will describe what will be asked of participants. An email will be supplied for further information if a participant has questions that are not answered on the Information Sheet. If No, please explain why you need to do this, and how the participants will be de-briefed? C6. Will financial / in kind payments (other than reasonable expenses and compensation for time) be offered to participants? (Indicate how much and on what basis this has been decided) No financial/in kind payments will be provided
About the Data C7. What data will be collected? (Tick all that apply)
Print Digital
Participant observation
Audio recording
Video recording
Computer logs
Questionnaires/Surveys X
Other:
Other:
57
C8. What measures will be put in place to ensure confidentiality of personal data, where appropriate? Each survey response will be associated with an arbitrary number, and all responses will be anonymous. C9. How/Where will the data be stored? The data will be exported from Google Docs and safely stored on my laptop and the University of Sheffield secure server which can only accessed by myself and Doctor Villa. C10. Will the data be stored for future re-use? If so, please explain The data will be only used for my postgraduate project, and other academic research purposes.
About the Procedure C11. Does your research raise any issues of personal safety for you or other researchers involved in the project (especially if taking place outside working hours or off University premises)? If so, please explain how it will be managed. There are no issues of personal safety involved with this survey and there will be no contact between the investigators and the participants. The survey will be anonymous.
58
The University of Sheffield. Information School
Research Ethics Review Declaration
Title of Research Project: People’s behaviours through image retrieve
We confirm our responsibility to deliver the research project in accordance with the University of
Sheffield’s policies and procedures, which include the University’s ‘Financial Regulations’, ‘Good
Research Practice Standards’ and the ‘Ethics Policy Governing Research Involving Human
Participants, Personal Data and Human Tissue’ (Ethics Policy) and, where externally funded, with the
terms and conditions of the research funder.
In submitting this research ethics application form I am also confirming that:
The form is accurate to the best of our knowledge and belief.
The project will abide by the University’s Ethics Policy.
There is no potential material interest that may, or may appear to, impair the independence
and objectivity of researchers conducting this project.
Subject to the research being approved, we undertake to adhere to the project protocol
without unagreed deviation and to comply with any conditions set out in the letter from the
University ethics reviewers notifying me of this.
We undertake to inform the ethics reviewers of significant changes to the protocol (by
contacting our academic department’s Ethics Coordinator in the first instance).
we are aware of our responsibility to be up to date and comply with the requirements of
the law and relevant guidelines relating to security and confidentiality of personal data,
including the need to register when necessary with the appropriate Data Protection Officer
(within the University the Data Protection Officer is based in CiCS).
We understand that the project, including research records and data, may be subject to
inspection for audit purposes, if required in future.
We understand that personal data about us as researchers in this form will be held by those
involved in the ethics review procedure (e.g. the Ethics Administrator and/or ethics
reviewers) and that this will be managed according to Data Protection Act principles.
If this is an application for a ‘generic’ project all the individual projects that fit under the
generic project are compatible with this application.
We understand that this project cannot be submitted for ethics approval in more than
one department, and that if I wish to appeal against the decision made, this must be done
through the original department.
Name of the Student (if applicable):
Junyu Li
Name of Principal Investigator (or the Supervisor):
Doctor Robert Villa
Date: July7, 2013
59
Appendix C Ethics Information Consent The University of Sheffield. Information School
People’s preferences, perceptions and frequency of using search engines to search for images
Researchers
Junyu Li Master of Informatics Student Information School University of Sheffield [email protected]
Doctor Robert Villa Lecturer of Information School Information School University of Sheffield [email protected]
Purpose of the research
We wish to better understand people’s experience of carrying out image searches, including people’s preferences, perceptions and frequency of use of image search engines. This study aims to investigate how people use image search engine, people’s behaviour using image retrieval systems and why people succeed or fail searching for images. This study would be important for building image search systems and will help to improve the image search systems in the future.
Who will be participating?
We are inviting undergraduate, postgraduate, doctoral students and staff enrolled in the University of Sheffield to participate in a web-based survey.
What will you be asked to do?
We will ask you to respond to a series of questions in three parts: Part 1 consists of four questions about education, gender, age, etc. to investigate participants’ basic information and draw a demographic profile. Part 2 will contain 6 questions about people’s preferences and frequency of using image search engines in both their daily and academic or work life. Part 3 will contain 9 questions about participants’ most recent experience of using image search engines including their perceptions and reasons. The survey will take about 10 minutes to complete.
What are the potential risks of participating?
The risks of participating are the same as those experienced in everyday life
What data will we collect?
We are only collecting your responses to the survey questions. No other data will be recorded.
What will we do with the data?
We will be analysing the data for inclusion in my master’s dissertation and other research purposes.
Will my participation be confidential?
The data is anonymous and all responses will be associated with a number. No personal identifying information will be recorded or retained.
What will happen to the results of the research project?
The results of this study will be included in my master’s dissertation which will be publicly available. Please contact the School in six months.
60
I confirm that I have read and understand the description of the research project, and that I have had an opportunity to ask questions about the project. I understand that my participation is voluntary and that I am free to withdraw at any time without any negative consequences. I understand that I may decline to answer any particular question or questions, or to do any of the activities. If I stop participating at all time, all of my data will be purged. I understand that my responses will be kept strictly confidential, that my name or identity will not be linked to any research materials, and that I will not be identified or identifiable in any report or reports that result from the research. I give permission for the research team members to have access to my anonymised responses. I give permission for the research team to re-use my data for future research as specified above. I agree to take part in the research project as described above.
Participant Name (Please print) Participant Signature
Researcher Name (Please print) Researcher Signature Date
Note: If you have any difficulties with, or wish to voice concern about, any aspect of your participation in this study, please contact Dr. Angela Lin, Research Ethics Coordinator, Information School, The University of Sheffield ([email protected]), or to the University Registrar and Secretary.
61
Appendix D Approval Letter
Information School Research Ethics Panel Letter of Approval Date: 12th July 2013 TO: Junyu Li The Information School Research Ethics Panel has examined the following application: Title: People’s preferences, perceptions and frequency of using search engines to search for images
Submitted by: Junyu Li And found the proposed research involving human participants to be in accordance with the University of Sheffield’s policies and procedures, which include the University’s ‘Financial Regulations’, ‘Good Research Practice Standards’ and the ‘Ethics Policy Governing Research Involving Human Participants, Personal Data and Human Tissue’ (Ethics Policy). This letter is the official record of ethics approval by the School, and should accompany any formal requests for evidence of research ethics approval. Effective Date: 12th July 2013
Dr Angela Lin Research Ethics Coordinator
62
Access to Dissertation A Dissertation submitted to the University may be held by the Department (or School) within which the Dissertation was undertaken and made available for borrowing or consultation in accordance with University Regulations. Requests for the loan of dissertations may be received from libraries in the UK and overseas. The Department may also receive requests from other organisations, as well as individuals. The conservation of the original dissertation is better assured if the Department and/or Library can fulfill such requests by sending a copy. The Department may also make your dissertation available via its web pages. In certain cases where confidentiality of information is concerned, if either the author or the supervisor so requests, the Department will withhold the dissertation from loan or consultation for the period specified below. Where no such restriction is in force, the Department may also deposit the Dissertation in the University of Sheffield Library.
To be completed by the Author – Select (a) or (b) by placing a tick in the appropriate box If you are willing to give permission for the Information School to make your dissertation available in these ways, please complete the following:
√ (a) Subject to the General Regulation on Intellectual Property, I, the author, agree to this dissertation being made immediately available through the Department and/or University Library for consultation, and for the Department and/or Library to reproduce this dissertation in whole or part in order to supply single copies for the purpose of research or private study
(b) Subject to the General Regulation on Intellectual Property, I, the author, request that this dissertation be withheld from loan, consultation or reproduction for a period of [ ] years from the date of its submission. Subsequent to this period, I agree to this dissertation being made available through the Department and/or University Library for consultation, and for the Department and/or Library to reproduce this dissertation in whole or part in order to supply single copies for the purpose of research or private study
Name Junyu Li
Department Information School
Signed Date 25/08/2013
To be completed by the Supervisor – Select (a) or (b) by placing a tick in the appropriate box
(a) I, the supervisor, agree to this dissertation being made immediately available through the Department and/or University Library for loan or consultation, subject to any special restrictions (*) agreed with external organisations as part of a collaborative project.
*Special restrictions
(b) I, the supervisor, request that this dissertation be withheld from loan, consultation or reproduction for a period of [ ] years from the date of its submission. Subsequent to this period, I, agree to this dissertation being made available through the Department and/or University Library for loan or consultation, subject to any special restrictions (*) agreed with external organisations as part of a collaborative project
Name
Department
Signed Date
THIS SHEET MUST BE SUBMITTED WITH DISSERTATIONS IN ACCORDANCE WITH DEPARTMENTAL REQUIREMENTS.