[ieee 2014 systems and information engineering design symposium (sieds) - charlottesville, va, usa...
TRANSCRIPT
Using Supervised Learning to Classify Clothing
Brand Styles
C. David Kreyenhagen, Timur I. Aleshin, Joseph E. Bouchard, Adam M. I. Wise, and Rachel K. Zalegowski University of Virginia, cdk3jf, tia4kj, jeb5ne, amw6mc, [email protected]
Abstract - Machine learning techniques have the
potential to alter the highly competitive online fashion
retail industry by improving customer service through
personalized recommendations. A fashion style
classification system can improve the customer search
functionality and provide a more personalized
experience for the user. Supervised learning techniques
with fashion based applications face the problem of
developing quantitative measures for describing fashion
products which are subjective in nature. To address this issue the authors asked fashion experts to assist in the
assembly of a training set of brand-style associations.
Quantitative measures were attributed to each brand in
the training set by applying natural language processing,
text mining, and eBay query results. This data set was
used to train a support vector machine which classified
the approximately 8000 remaining brands into style
categories. The prospective classifier model was assessed
based on its positive predictive values which yielded a
56.25% success rate. Given that there are eight different
styles to choose from, a baseline for the percentage is
only 12.5%. The SVM thus adds significant value to the
classification of fashion brands. The final style
categorization was integrated as a new filter feature that
allows the user to narrow down their searches and
access relevant results.
Index Terms – Fashion, Machine learning, Supervised
learning, Support vector machines, Text mining
INTRODUCTION
Since a shopper’s purchasing decisions are often influenced
by the opinions of others [1], many customers seek the
assistance of a personal shopper in order to stay up to date
with today’s fashion styles and trends. Although these personal shoppers provide a helpful, unbiased opinion,
many customers are too busy, or cannot afford, to utilize
personal shoppers that are offered at popular department
stores.
Best Fashion Friend (BFF), a startup social fashion
application based out of New York City, NY, seeks to
provide a shopping experience comparable to that of a
personal shopper by incorporating social media aspects and
personalized shopping into a web application. BFF allows
users to do many things including but not limited to:
shopping for specific items of clothing, designing looks that
involve multiple pieces of clothing, and asking for a friend’s
approval of clothing items and/or looks. By creating a social
network centered on fashion, the company hopes to drive
sales in the online fashion industry.
The goal of the authors, as consultants for BFF, is to
improve the search feature within the app. A fashion style
classification can improve the customer search functionality
and provide a more personalized experience for the user.
Categorizing search results by fashion style and
implementing a filtering system would allow users to
manually refine their search results. This manual filtering gives users greater control in finding merchandise that
interests them. Connecting users to more clothing they like
serves BFF’s higher-level objective of increasing the
number of purchases from their affiliate network of vendors.
BFF earns a commission on every affiliate network
purchase that occurs after a user finds clothing through the
app. Filtering also makes the app more useful and will allow
BFF to improve customer loyalty and attract new customers.
A more complete visual of these goals is shown in Figure 1.
FIGURE 1
BFF COMPANY OBJECTIVES
PROBLEM DEFINITION
I. Motivation for Personalization
Liang, Chen, and Turban found that the perceived
usefulness of a service is higher for personalized services
than for non-personalized ones [2]. Since the usefulness of a
service influences the amount of user traffic seen by that
978-1-4799-4836-9/14/$31.00 (c) 2014, IEEE 239
service, BFF can improve customer loyalty, increase the
user base, and attract new customers by providing the
opportunity for user personalization. It has been shown that
“non-monetary benefits such as convenience from online
personalization can also serve as incentives for consumers
to part with their personal and preference information” [3].
The BFF application already incorporates some social media
functions, such as allowing users to become “friends” with
one another to view each other’s fashion preferences. However, more personalization is desired, specifically in the
BFF Store feature. Therefore, the authors' goal is to improve
the personalization of the Best Fashion Friend search
functionality.
II. Recommendation System Approach to Personalization
Besides the use of social media functions, another way personalization may be incorporated into the BFF
application is through a personalized recommendation
system. A recommendation system is used to apply
knowledge and techniques to the problem of making
personalized recommendations during an interaction with a
user [4]. The most logical place to implement the concept of
a recommendation system is the BFF Store, where the
system would provide search results tailored to an
individual’s specific interests.
Most recommendation systems require at least some
information about the users, whether it be demographic data
or preferences. This is referred to as the "cold start"
problem. After delving deeper into BFF’s user data, the
authors determined that there was not enough useful data to
overcome this problem and create a recommendation system
of any significance.
III. Style-brand Classification Using Support Vector Machines
Introducing a filtering system circumvents the cold start
problem while improving the personalization of the BFF
Store search. In addition to basic filtering, the BFF user will
also be able to refine his or her search results based on
fashion style preferences. In order to create a style-based filter, the items in the BFF Store must first be classified. By
applying concepts of support vector machines (SVM) to the
item descriptions, the authors were able to create a method
to classify each brand according to its dominant style.
According to Martin-Barraga, Lillo, and Romo in the
European Journal of Operational Research, SVMs are
powerful tools that provide nonparametric classifications on
high-dimensional data [5]. According to a study performed
using SVMs for Science Direct Journal, due to their
simplicity and their embodiment of structural risk
minimization, SVMs are recommended as the standard
intelligence technique for classification [6]. It is their
simplicity and ease of use on high-dimensional data that
made SVMs the obvious tool to apply when creating style-
brand classifications to improve the user’s personalized
experience on BFF.
SYSTEM DESIGN
A style filter was created which allows users to refine their
search results based on fashion style. The filter relies on
eight lists of fashion brands. These lists are titled Active,
Bohemian, Classic, Edgy, Couture, Minimalist, Preppy, and
Outdoor. Each list includes the brands which are most
representative of these respective styles. The means by
which they were created and implemented are described
below.
I. System Architecture
The BFF Store function is a means by which users can
discover new clothing items, add items to their BFF closet,
and make clothing purchases. The user can find new items
by first inputting a search query. Members of the BFF
affiliate network are then searched for items matching the
query. Items are returned with information regarding brand,
price, color, and category. If the user purchases an item
from an affiliate, BFF is awarded a portion of the sale.
The new interface for the BFF Store allows users to filter their searches. Once they have searched for an item,
the results can be refined based on item attributes. Users
can also filter the results based on their style preferences
(Figure 2).
FIGURE 2
BFF STORE INTERFACE WITH STYLE FILTER
Since style is not an attribute passed down from the
affiliate network, style filtering must be done after the
results are on BFF servers. A supervised learning algorithm
was used to classify the approximately 8,000 fashion brands
in the affiliate network by rating them in each style
category. When a style is selected, products made by
brands that are the most affiliated with the selected style are shown to the user first. The filtering process is shown in
Figure 3.
II. Methodology
Each article of the clothing available from the affiliate
network is defined as a JSON object embedded with certain
characteristics, namely brand and a short text description of the item. Assuming that a clothing item’s style is correlated
to the item’s brand, styles can be assigned to a large set of
240
FIGURE 3
FILTER SYSTEM ARCHITECTURE
clothes by classifying the smaller subset of brands. To
classify each brand as a member of one of the eight
predefined styles, each brand was treated as a point in a 16
dimensional space and evaluated by the support vector
machine. Eight of these dimensions measure the relevance
of a style using the tf-idf method of text mining [7].
Term frequency-inverse document frequency (tf-idf) is a statistical method to determine how important an
individual word or phrase is to discerning the uniqueness of
a block of text. For each brand, a block of text
characterizing that brand was created by extracting the
adjectives, adverbs, and nouns from the name and
description of every item in the affiliate network that
belongs to that brand. These text blocks were fed into the tf-
idf algorithm which determines the frequency of each word
in the brand’s text block relative to the frequency of that
word in the global corpus of all text blocks. This is
calculated into a tf-idf statistic, which can be used to weigh
the relative importance of that word in assigning a style to a
brand. This method is commonly used in search engines to
determine how relevant a webpage is based on its text
content given the search query.
The two metrics involved in calculating the tf-idf
statistic are term frequency and inverse document frequency. Term frequency is the frequency of a word in a
document, or the local importance to the individual text
block. The inverse document frequency measures the global
relevance of a word compared to the complete text corpus,
which consists of the text blocks of all the evaluated brands.
The tf-idf statistic is the product of the term frequency and
the inverse document frequency.
Term Frequency: occurrence of word in document
Inverse Document Frequency:
Where N is the total number of brand descriptions in the
corpus.
The tf-idf algorithm produces a value of 0 to 1 for each
brand-style association, 1 meaning the brand is likely to be
associated with the given style. Since there are eight styles,
the output creates a point for each brand in an eight
dimensional space. Another set of eight dimensions are
added by determining the brand-style associations resulting
from eBay searches.
The eBay brand-style associations are calculated by
taking the fraction of two search queries. The returned
search result count from the first query, which contains the
brand and the style, serves as the numerator. The denominator is purely the search result count of the query
containing the brand only. When calculated for all styles the
resulting set of eight fractions complete the 16 dimensional
point for a specific brand. The data structure of these points
is shown in Figure 4. Each brand’s point is then used for
the SVM.
FIGURE 4
BRAND DATA STRUCTURE
The SVM uses a training set of 65 brands which have
been expertly classified and associated with one of the eight
different styles. A test set of 32 brands is then used to
evaluate the SVM classification of each brand. The metric
for successful classification of the brands is the percentage
of total brands that are classified correctly by the SVM,
which is referred to as the percentage of positively predicted
values.
III. Support Vector Machine Definition
The support vector machine used for style classification is a
supervised learning algorithm. It takes as input a training
set of 97 brand of the form
where is each brand, as represented by a point in the
space . is a 16-dimmensional space with dimensions
corresponding to the eBay and tf-idf data. is the
predetermined expert classification of each brand from a set
of styles,
The SVM solution is a set of 7 hyperplanes which divide the space as follows
(1)
Where is a unit vector , such that . is
the margin to the nearest point, as shown in Figure 5 (Hastie
p. 418).
Affiliate Network
Filter
Price
Color
Brand
Category
BFF Servers
Filter
Style User
241
FIGURE 5
MARGIN CALCULATION USED IN DETERMINING HYPERPLANE FIT
A radial basis function was chosen for the calculation of
each hyperplane in (1), and is implemented by
determines how influential any single point is.
can be determined by solving for any
. Parameter determines the smoothness of
the boundary. If is low, then will be low, and the
boundary will be smooth.
The classification rule for any point is
(2)
The optimization function for the SVM is formalized as
(3)
where . The optimization allows for soft margins,
which allows misclassified points to be separated from their
intended subspace. is an expression which penalizes
misclassification. The optimization attempts to draw hyperplanes with margins as wide as possible, while
minimizing the number of misclassifications. A simplified
visualization of this algorithm is shown in Figure 6 (Hastie
p. 21). A visualization using actual data for Outdoor and
Preppy eBay search data is shown in Figure 7.
FIGURE 6
HYPERPLANE EXAMPLE USING RADIAL BASIS KERNEL FUNCTION AND
SOFT MARGINS
FIGURE 7
HYPERPLANE EXAMPLE USING ACTUAL DATA FOR OUTDOOR AND PREPPY
STYLE DIMENSIONS
RESULTS AND DISCUSSION
Our results yielded 56.25% positively predicted values.
Given that there are eight different styles to choose from a
baseline for the percentage is only 12.5%. The SVM thus
adds significant value in association with natural language
processing to the classification of fashion brands.
Though it is a step in the right direction, 56.25% is not high enough to confidently deploy this methodology
directly as the only source of logic for a classification
system. There are certain factors of the experiment that
could be improved upon to yield better results. Improving
the quality of the training set would be the most influential.
The authors have limited knowledge of the existing fashion
brands and despite the assistance of fashion experts, the
brands style associations and the chosen brands could be
further refined. The size of the training set could also be
increased.
I. Future Work
The primary objective of the project was to personalize the
BFF user experience by allowing users to select styles that
appeal to them. In pursuit of this objective the team was
242
able to classify clothing items into a set of styles by using
machine learning, natural language processing, and text
mining techniques. A recommendation system for BFF
would be the next step of personalization.
Most recommendation algorithms require a moderate
amount of information about users. In order to combat the
cold start problem, some fashion applications require the
user to pick from certain brands and answer questionnaires
to gauge their initial preferences. This technique is similar to what companies like Pandora and Netflix do for
entertainment recommendation.
One methodology for gathering user preferences uses
conditional decision trees, by selecting one option as your
primary preference the next group of options presented is
based on the original decision. This process repeats with the
objective to not necessarily find the best alternative, but to
best categorize the user for future recommendation
purposes. Each conditional probability of selecting a
particular option uses Bayesian forecasting to formulate the
weights of the probabilities of belonging to a user category.
Decision trees are not the only solution to for building out
this type of system, but just one proposed alternative. The
authors suggest further research into algorithms that utilize
machine learning and leverage the style-brand classification
knowledge already gathered.
CONCLUSION
Best Fashion Friend seeks to provide a shopping experience
comparable to that of a personal shopper by incorporating
social media aspects and personalized shopping into a web
application. One key challenge presented by this goal is the
difficulty of showing users clothes that match their style
preferences. To this end, the approximately 8,000 clothing
brands in the site’s affiliate network were classified into
eight unique styles. This allows users to look for clothes that specifically adhere to their personal taste.
A Support Vector Machine was created to address the
classification task. It took search result measures from eBay
and item description text as input for each brand. Given a
training set of 97 brands, it can now classify new brands by
style. Results from the algorithm evaluation were
promising. They yielded 56.25% positively predicted
values, well above the baseline of 12.5%. This indicates
that supervised learning is a valid approach to style
classification. Higher success rates could likely be realized
by acquiring more data and combining multiple machine
learning algorithms. These results imply that the application
of machine learning has significant value in the highly
subjective topic area of women’s fashion.
ACKNOWLEDGMENT
The authors would like to thank Professor Alfredo Garcia of
the Department of Systems and Information Engineering at
the University of Virginia. They would like to thank all of
the employees of Best Fashion Friends. They would like to
thank the survey respondents. The authors take full
responsibility for the ideas, concepts, and opinions
presented here.
REFERENCES
[1] P. Alreck and R. Settle, "Gender effects on Internet, catalogue and
store shopping," The Journal of Database Marketing, vol. 9, no. 2,
pp. 150-162, 2002.
[2] T. Liang, H. Chen and E. Turban, "Effect of personalization on the
perceived usefulness of online customer services: a dual-core
theory," Journal of Electronic Commerce Research, vol. 13, no. 4,
pp. 275-288, 2010.
[3] R. Chellappa and R. Settle, "Personalization versus Privacy: An
Empirical Examination of the Online Consumer’s Dilemma.,"
Information Technology and Management, vol. 6, no. 2-3, p. 198,
2005.
[4] B. Sarwar, G. Karypis, J. Konstan and J. Riedl, "Item-Based
Collaborative Filtering Recommendation Algorithms," GroupLens
Research Group, pp. 285-295, 2001.
[5] B. Martin-Barragan, R. Lillo and J. Romo, "Interpretable support
vector machines for functional data," European Journal of
Operational Research, vol. 232, pp. 146-155, 2014.
[6] A. Al-Anazi and I. Gates, "Support Vector Regression for Porosity
Prediction in a Heterogeneous Reservoir: A Comparative Study,"
Computers & Geosciences, vol. 36, no. 12, pp. 1494-1503, 2010.
[7] G. Salton and M. J. McGill, Introduction to modern information
retrieval, 1983.
[8] R. Rehurek, "Models.tfidfmodel-TF-IDF model," 2014 March 2014.
[Online]. Available:
http://radimrehurek.com/gensim/models/tfidfmodel.html .
[9] M. Pazzani and D. Billsus, "Content-Based Recommendation
Systems," pp. 325-341, 2007.
[10] D. Lemire and A. Maclachlan, "Slope One Predictors for Online
Rating-Based Collaborative Filtering," Proceedings of the 2005
SIAM International Conference on Data Mining, 2005.
[11] T. Kuflik, Z. Boger and P. Shoval, Information Processing &
Management, vol. 42, pp. 469-483, 2005.
[12] A. Kim and E. Ko, "Impacts of Luxury Fashion Brand’s Social
Media Marketing on Customer Relationship and Purchase Intention,"
Journal of Global Fashion Marketing, vol. 1, no. 3, pp. 1480-1486,
2010.
[13] T. Hastie, R. Tibshirani and J. Friedman, The Elements of Statistical
Learning, vol. 2, no. 1, 2009.
[14] R. Burke, "Hybrid Recommender Systems: Survey and
Experiments," User Modeling and User-Adapted Interaction, vol. 12,
pp. 331-370, 2002.
[15] P. e. al, "Scikit-learn: Machine Learning in Python," Scikit-learn,
2011. [Online]. Available:
http://jmlr.org/papers/v12/pedregosa11a.html.
243