IEEE TRANSACTION ON KNOWLEDGE AND DATA ENGINEERING
1
Learning Image-Text Associations
Tao Jiang and Ah-Hwee Tan, Senior Member, IEEE
Abstract—Web information fusion can be defined as the problem of collating and tracking information related to specific topics on the World Wide Web. Whereas most existing work on web information fusion has focused on text-based multi-document summarization, this paper concerns the topic of image and text association, a cornerstone of cross-media web information fusion. Specifically, we present two learning methods for discovering the underlying associations between images and texts based on small training data sets. The first method based on vague transformation measures the information similarity between the visual features and the textual features through a set of predefined domain-specific information categories. Another method uses a neural network to learn direct mapping between the visual and textual features by automatically and incrementally summarizing the associated features into a set of information templates. Despite their distinct approaches, our experimental results on a terrorist domain document set show that both methods are capable of learning associations between images and texts from a small training data set. Index Terms—Data Mining, Multimedia Data Mining, Image-Text Association Mining.
!
1
I NTRODUCTION
The diverse and distributed nature of the information published on the World Wide Web has made it difficult to collate and track information related to specific topics. Although web search engines have reduced information overloading to a certain extent, the information in the retrieved documents still contains a lot of redundancy. Techniques are needed in web information fusion, involving filtering of irrelevant and redundant information, collating of information according to themes, and generation of coherent presentation. As a commonly used technique for information fusion, document summarization... [continues]
1
Learning Image-Text Associations
Tao Jiang and Ah-Hwee Tan, Senior Member, IEEE
Abstract—Web information fusion can be defined as the problem of collating and tracking information related to specific topics on the World Wide Web. Whereas most existing work on web information fusion has focused on text-based multi-document summarization, this paper concerns the topic of image and text association, a cornerstone of cross-media web information fusion. Specifically, we present two learning methods for discovering the underlying associations between images and texts based on small training data sets. The first method based on vague transformation measures the information similarity between the visual features and the textual features through a set of predefined domain-specific information categories. Another method uses a neural network to learn direct mapping between the visual and textual features by automatically and incrementally summarizing the associated features into a set of information templates. Despite their distinct approaches, our experimental results on a terrorist domain document set show that both methods are capable of learning associations between images and texts from a small training data set. Index Terms—Data Mining, Multimedia Data Mining, Image-Text Association Mining.
!
1
I NTRODUCTION
The diverse and distributed nature of the information published on the World Wide Web has made it difficult to collate and track information related to specific topics. Although web search engines have reduced information overloading to a certain extent, the information in the retrieved documents still contains a lot of redundancy. Techniques are needed in web information fusion, involving filtering of irrelevant and redundant information, collating of information according to themes, and generation of coherent presentation. As a commonly used technique for information fusion, document summarization... [continues]
Cite This Essay
- APA
-
(2011, 03). Learning Image Text Associations. StudyMode.com. Retrieved 03, 2011, from http://www.studymode.com/essays/Learning-Image-Text-Associations-609369.html
- MLA
-
"Learning Image Text Associations" StudyMode.com. 03 2011. 03 2011 <http://www.studymode.com/essays/Learning-Image-Text-Associations-609369.html>.
- CHICAGO
-
"Learning Image Text Associations." StudyMode.com. 03, 2011. Accessed 03, 2011. http://www.studymode.com/essays/Learning-Image-Text-Associations-609369.html.