DEVELOPING PRICING MODELS FOR ONLINE ART SALES USING

  • Slides: 26
Download presentation
DEVELOPING PRICING MODELS FOR ONLINE ART SALES USING TEXT ANALYTICS LAUREL POWELL, ZBIGNIEW W.

DEVELOPING PRICING MODELS FOR ONLINE ART SALES USING TEXT ANALYTICS LAUREL POWELL, ZBIGNIEW W. RAS COMPUTER SCIENCE DEPT. , COLLEGE OF COMPUTING & INFORMATICS (CCI) SA UNIVERSITY OF NORTH CAROLINA, CHARLOTTE, USA & ANNA GELICH CS SCHOOL OF ARCHITECTURE, COLLEGE OF ARTS + ARCHITECTURE UNIVERSITY OF NORTH CAROLINA, CHARLOTTE, USA Research Supported by

OUTLINE • Introduction • Current Art Market • Pricing in the Art Market (galleries,

OUTLINE • Introduction • Current Art Market • Pricing in the Art Market (galleries, online sales, street market, …) • Features having impact on Price (size, name of artist, medium…. ) • Knowledge Based Recommender System for Assigning Price Tags to Art • Current results

INFORMATION ABOUT PAINTING/ARTIST – AN EXAMPLE • Anastasia Kachina • Size: 11. 81 x

INFORMATION ABOUT PAINTING/ARTIST – AN EXAMPLE • Anastasia Kachina • Size: 11. 81 x 15. 75 in • $405 -> sold for $350 by Saatchi Art about 1 year • 431 artworks ago • Education: MA • Kharkiv Art College (1998 -2003) • Kharkiv State Academy of Design and Arts (20032010) • Exhibitions: 10 • Locations and Type of Exhibitions The same type of her painting costs now about $700 (one year ago she had much less artworks)

INFORMATION ABOUT PAINTING/ARTIST – ANOTHER EXAMPLE • Annabel Andrews • $5, 200 – asking

INFORMATION ABOUT PAINTING/ARTIST – ANOTHER EXAMPLE • Annabel Andrews • $5, 200 – asking price by Saatchi Art • Born in 1940, lives in Spain • Artworks - 154 • Many individual and collective exhibitions • Received many prizes Asking price - $1, 300

CONTEMPORARY ART MARKET • Multi-billion dollar industry • 63. 7 Billion USD in 2017

CONTEMPORARY ART MARKET • Multi-billion dollar industry • 63. 7 Billion USD in 2017 • More galleries closed than opened (what business model to follow to be successful? ) NPS based or GET-KEEP-GROW? • Highly uncertain, and has proven to be a challenge to explain using traditional economic theories. What about data analytics? • 29% of sales take place online (Saatchi. Art, Artfinder, …. ) • Price transparency key concern of online buyers

ART AND RECOMMENDER SYSTEMS • Existing recommender systems in the art market domain do

ART AND RECOMMENDER SYSTEMS • Existing recommender systems in the art market domain do not use data analytics but human experts to evaluate fine art pieces and make recommendations. • Mutual. Art (https: //www. mutualart. com/artappraisal) has the world’s most comprehensive database of past sale results but the number of features describing these sales is quite small. Its advisors are assigning price tags to new pieces of art by comparing them with similar pieces in the Mutual. Art database. • The charge for a single service (one piece of art) is $49 and the waiting time to get a recommendation is 72 hours

ART AND RECOMMENDER SYSTEMS • FINDARTINFO (http: //www. findartinfo. com/english. html) is a similar

ART AND RECOMMENDER SYSTEMS • FINDARTINFO (http: //www. findartinfo. com/english. html) is a similar but free art appraisal service which contains information about 438, 003 artists and 3, 775, 762 art prices. With this art appraisal tool, an artist can value his/her fine art by comparing it with recent auction prices of similar pieces. There are websites providing free art appraisal hints. • For instance, wiki. How (https: //www. wikihow. com/Value-Your-Art) helps to value artworks.

RECOMMENDER SYSTEM - ARTIST • It is based on data analytics. Knowing the artist’s

RECOMMENDER SYSTEM - ARTIST • It is based on data analytics. Knowing the artist’s name, appraisal of the piece of art is done by Art. IST by its personalized discovery module built from the data describing similar artists and similar art pieces including their sales. • Dataset • Base Features (medium, size of painting, subject, style, artist name, features describing artist reputation, …) • New Features Developed and Tested • Word Count (in artist & artworks provided information) • Social Media (presence or absence of social media links in artist provided information) • Clustering & sentiment analysis of artworks provided information

 • Scraped from Artfinder. com • About 200, 000 artworks (currently more than

• Scraped from Artfinder. com • About 200, 000 artworks (currently more than ½ million) • 2, 800 artists (currently more than 5, 000 artists) • 60 countries DATASET • Artist Pages (information about artists which has optional sections: CV, education, awards received, links to other locations presenting the artist, …) • Artwork Pages • Reviews We collaborate with QCollector which has data about 5 million paintings being sold by them.

PRICE DISTRIBUTIO N Maximum 1, 000 USD Minimum 12. 97 USD Figure omits works

PRICE DISTRIBUTIO N Maximum 1, 000 USD Minimum 12. 97 USD Figure omits works priced at greater than 5000 USD due to scale

BASE FEATURES • – artist. ID - A unique identifier for an artist •

BASE FEATURES • – artist. ID - A unique identifier for an artist • – artist. Country - The artist’s current country of residence as listed on their profile • – percent n stars (n=1, 2, 3, 4, 5) - Percentage of n star reviews out of the total number of reviews. • – medium - Artist provided medium of the artwork. • – style - Artist provided style of the artwork. • – subject - Artist provided subject of the artwork. • – authentication - Artist provided method of authenticating the work. • – artwork width - The width of the artwork. • – artwork height - The height of the artwork.

DISCRETIZATION • Price dimension was reduced to discrete intervals. • These intervals are as

DISCRETIZATION • Price dimension was reduced to discrete intervals. • These intervals are as follows: (0 - 105), [105 - 205), [205 - 405), [405 - 605), [605 - 810), [810 - 1030), [1030 1445), [1445 - 1825), [1825 - 2455), [2455 - 3855), [3855 5000), [5000 - 10, 000), (> 10, 000). The intervals were selected by searching for areas where very few works were priced and splitting the price feature in these gaps.

Confusion Matrix with Base Features for Artfinder Dataset

Confusion Matrix with Base Features for Artfinder Dataset

RESULTS WITH BASE SET OF FEATURES Method F 1 Score Precision Recall k. NN

RESULTS WITH BASE SET OF FEATURES Method F 1 Score Precision Recall k. NN 0. 621 0. 619 0. 625 SVM 0. 17 0. 221 0. 169 Random Forest 0. 663 0. 662 0. 667

RESULTS WITH NEW FEATURES: BIOGRAPHY, DESCRIPTION OF ARTWORKS, TITLE

RESULTS WITH NEW FEATURES: BIOGRAPHY, DESCRIPTION OF ARTWORKS, TITLE

RESULTS WITH SOCIAL MEDIA PRESENCE

RESULTS WITH SOCIAL MEDIA PRESENCE

TEXT CLUSTERING Awards Biography Descriptio n Education Events Title

TEXT CLUSTERING Awards Biography Descriptio n Education Events Title

TEXT CLUSTERING • Paragraph Vector • Also known as Doc 2 Vec • Extension

TEXT CLUSTERING • Paragraph Vector • Also known as Doc 2 Vec • Extension of Word 2 Vec • 10, 25 and 50 Clusters • Biography, Awards, Events and Education most impactful We are currently replacing Paragraph Vector by Folksonomy

RESULTS WITH 10 CLUSTERS

RESULTS WITH 10 CLUSTERS

RESULTS WITH 25 CLUSTERS

RESULTS WITH 25 CLUSTERS

RESULTS WITH 50 CLUSTERS

RESULTS WITH 50 CLUSTERS

ALL FEATURES (25 CLUSTERS)

ALL FEATURES (25 CLUSTERS)

 • VADER SENTIMENT ANALYSIS • Valence Aware Dictionary for s. Entiment Reasoning •

• VADER SENTIMENT ANALYSIS • Valence Aware Dictionary for s. Entiment Reasoning • Rules and Lexicon • Biography, Description of Artworks, Title

COMBINED FEATURES F 1 Precision Recall Base Features 0. 664 0. 663 0. 667

COMBINED FEATURES F 1 Precision Recall Base Features 0. 664 0. 663 0. 667 BF, Social, Word Counts 0. 677 0. 676 0. 681 BF, Social, Word Counts, Clusters (10) 0. 674 0. 678 BF, Social, Word Counts, Clusters (25) 0. 676 0. 675 0. 68 BF, Social, Word Counts, Clusters (50) 0. 679 0. 678 0. 683 BF, Social, Word Counts, Sentiment 0. 685 0. 684 0. 689 BF, Social, Word Counts, Sentiment, Clusters (50) 0. 685 0. 69

With Base Features Confusion Matrix with Base Features and All Added Features

With Base Features Confusion Matrix with Base Features and All Added Features