Some ideas around PIADA Picture Indexing Affect Description

  • Slides: 17
Download presentation
Some ideas around PIADA: (Picture Indexing: Affect, Description and Availability) Diana Santos Information and

Some ideas around PIADA: (Picture Indexing: Affect, Description and Availability) Diana Santos Information and Communication Technologies 1

PIADA in a nutshell n Context: making sense of images n High-level concerns: n

PIADA in a nutshell n Context: making sense of images n High-level concerns: n Purpose n Interaction with text/context n Cultural factors n Jokes, irony, metaphor, affect, humour n Practical setup: Web archive, image repositories, Wikimedia n Expected outcomes n Picture ontologies (as well as integration with other NLP ontologies) n Reasoning with images n Cross-cultural understanding of image differences Information and Communication Technologies 2

Presentation outlook n A short description of the main ideas of PIADA n 5

Presentation outlook n A short description of the main ideas of PIADA n 5 things not yet solved n How to go about it? Data; 3 kinds of reasoning n SINTEF’s LEG (Language Engineering Group) n PIADA in a Norwegian context Information and Communication Technologies 3

1. Find images with feelings n Most image descriptions are about “objective” features like

1. Find images with feelings n Most image descriptions are about “objective” features like who (names), where (places), dates, colours and objects n Asterix sad – no way, and we know there exist these pictures n Try Hillary happy and Hillary angry and you get mostly the same pictures: those of Hillary Clinton together with texts that may report whatever about happiness or sadness of her competitors or fans Information and Communication Technologies 4

2. Find culturally implicit information n Japanese people in the underground (tube): most probably

2. Find culturally implicit information n Japanese people in the underground (tube): most probably posted by Japanese people, not with captions saying that people are Japanese. . . n Try Google images and give up n Man reading. This can be a good enough caption in an European museological context, but certainly not in an Asian, or African context Information and Communication Technologies 5

3. Find synergy between images and text n why this caption or this illustration?

3. Find synergy between images and text n why this caption or this illustration? why do they work together? creative use of pictures in text: or: how many of these images of Sócrates are jokes? n from the obvious: a happy baby in a diaper’s advertisement to the provocative or very subtle humour (was it? ) Information and Communication Technologies 6

4. Find the picture’s purpose http: //staff. science. uva. nl/~marx/ n Why is this

4. Find the picture’s purpose http: //staff. science. uva. nl/~marx/ n Why is this picture here? n Reasoning about a multimedia world. . . finding the cause for humour and the important details How my students end up. . n Vi roser Anne! Information and Communication Technologies 7

5. Help discuss or describe pictures for different purposes Properties of pictures are often

5. Help discuss or describe pictures for different purposes Properties of pictures are often mentioned in some contexts n (didactical, antropological, documentary, scientific) to focus on particular details: see the character behind Jesus, see the tree on the left, note the tool on his hand, look at the back wings, at the dark clouds, at the tumor. . . n (police or law courts) to explain why the picture was taken: revolver under the table, near the corpse n (artistic setting): in sunlight, in rainy wather, with a special lins. . . Information and Communication Technologies 8

In a nutshell, we believe that n there are many aspects of picture description

In a nutshell, we believe that n there are many aspects of picture description and reasoning that have not received any or enough attention, namely n emotions n creativity and humour n crosscultural differences n intertextuality with pictures n a natural language processing angle is the right way to attack them Information and Communication Technologies 9

Real applications n Significantly enhancing image bank providers activity n Helping professionals that need

Real applications n Significantly enhancing image bank providers activity n Helping professionals that need images and text n teachers n museum staff n encyclopedia authors (the Wikipedia community) n other multimedia content providers: for games, educational CDs, textbooks n advertisers n historians and biographers n Common multilingual image search Information and Communication Technologies 10

How to go about it? Data n Image collections to study n Image ontologies

How to go about it? Data n Image collections to study n Image ontologies and folksonomies available n “Text and image” collections n Wikipedia n Web pages (Internet archive) n Special sites and or multimedia products (guided tours) n Image search logs n Elicitation collections: sets of stories about image search n Game results: eliciting similarities or associations among images Information and Communication Technologies 11

How to go about it? Reasoning 1 n If you want to find a

How to go about it? Reasoning 1 n If you want to find a picture of a strong healthy man, or of a genius, you probably find an instance of a person that illustrates these qualities (such as Johny Weissmuller or Einstein and look for them) n If you want to find a picture of Asterix angry, you can look for more “objective” descriptions such as Asterix shouting or Asterix beating n If you want to illustrate the property black you probably look for concepts where black is stereotypical, such as coffee. . . Information and Communication Technologies 12

How to go about it? Reasoning 2 n You need to know the context

How to go about it? Reasoning 2 n You need to know the context of the annotation or text to know what should be implicit and what should be probably commented out n searches in. pt for Sócrates are probably about the prime-minister but in. no, after the philosopher, comes the Brazilian football player n the stereotypical image for man or woman is obviously different depending on the gender of the beholder, no matter sexual orientation, and the same for places and cultures n a “typical” restaurant (and its food) varies widely n pictures captioned Lisbon (or Dublin, or Helsinki) are most often by tourists. People who live in Lisbon give the precise names of what they take the picture of, or don’t even bother to specify location Information and Communication Technologies 13

How to go about it? Reasoning 3 n n n Why are the images

How to go about it? Reasoning 3 n n n Why are the images chosen? What is the kind of connection? What is their import? Which other associations -- interpictuality -- they bring? Why can they be considered offensive or funny? Do they feel old-fashioned? Do they feel modern? Information and Communication Technologies 14

Language engineering at SINTEF n n n Question answering Ontologies Geographical reasoning Contrastive studies

Language engineering at SINTEF n n n Question answering Ontologies Geographical reasoning Contrastive studies Information extraction Corpus search We believe all these pieces will help us to address the image search and indexing issue. Information and Communication Technologies 15

PIADA in a Norwegian context We want to develop specific knowledge on images in

PIADA in a Norwegian context We want to develop specific knowledge on images in Norwegian n The vocabulary of images and image search in Norway n Demo collections n Study of user behaviour: what do people ask for, what do they want to see? n Picture reasoning ontologies n We hope that by cooperating with commercial actors we will do something useful not only for research purposes Information and Communication Technologies 16

Specific proposal n brukerstyrte innovasjonsprosjekter (BIP) OR n kompetanseprosjekter med brukermedvirkning (KMB) n Scanpix

Specific proposal n brukerstyrte innovasjonsprosjekter (BIP) OR n kompetanseprosjekter med brukermedvirkning (KMB) n Scanpix Norge as prime contractor n SINTEF writes most of the proposal n ABM-utvikling is also involved n Other Norwegian actors also contacted Information and Communication Technologies 17