Preview

Information Retrieval Systems

Best Essays
Open Document
Open Document
2993 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Information Retrieval Systems
Research Proposal

Multimodal Information Spaces for Content-based Image Retrieval
Abstract Currently, image retrieval by content is a research problem of great interest in academia and the industry, due to the large collections of images available in different contexts. One of the main challenges to develop effective image retrieval systems is the automatic identification of semantic image contents. This research proposal aims to design a model for image retrieval able to take advantage of different data sources, i.e. using multimodal information, to improve the response of an image retrieval system. In particular two data modalities associated to contents and context of images are considered in this proposal: visual features and unstructured text annotations. The proposed framework is based on kernel methods that provide two main important advantages over the traditional multimodal approaches: first, the structure of each modality is preserved in a high dimensional feature space, and second, they provide natural ways to fuse feature spaces in a unique information space. This document presents the research agenda to build a Multimodal Information Space for searching images by content.

Presented by
Juan Carlos Caicedo Rueda

Research Advisor
Prof. Fabio A. Gonz´lez O. Ph.D a

Area
Computer Science

Research Fields
Information Retrieval and Machine Learning.

1

1 INTRODUCTION

2

1

INTRODUCTION

Content-Based Image Retrieval (CBIR) is an active research discipline focused on computational strategies to search for relevant images based on visual content analysis. In this proposal, multimodal analysis is considered to develop CBIR systems, specially for image collections in which there is some text associated to images. Multimodality in Information Retrieval is sometimes referred to the interaction mechanisms and devices used to query the system. However, since the Multimedia Information Retrieval perspective, multimodality is referred to



References: [1] M. S. Lew, N. Sebe, C. Djeraba, and R. Jain, “Content-based multimedia information retrieval: State of the art and challenges,” ACM Trans. Multimedia Comput. Commun. Appl., vol. 2, no. 1, pp. 1–19, February 2006. [2] R. Datta, D. Joshi, J. Li, and J. Z. Wang, “Image retrieval: Ideas, influences, and trends of the new age,” ACM Comput. Surv., vol. 40, no. 2, pp. 1–60, April 2008. [3] J. Shawe-Taylor and N. Cristianini, Kernel methods for pattern analysis. Press, 2004. Cambridge University [4] J. C. Caicedo, A. Cruz, and F. Gonzalez, “Histopathology image classification using bag of features and kernel functions,” Artificial Intelligence in Medicine Conference, AIME 2009, vol. LNAI 5651, pp. 126–135, 2009. [5] J. C. Caicedo, F. A. Gonzalez, and E. Romero, “Content-based medical image retrieval using a kernel-based semantic annotation framework.” Technical Report UN-BI-2009-01 - National University of Colombia. Submitted to the Artificial Intelligence in Medicine Journal, Tech. Rep., 2009. [6] C. D. Manning, P. Raghavan, and H. Sch¨tze, Introduction to Information Retrieval. Cambridge u University Press, 2008. [7] T. G¨rtner, J. W. Lloyd, and P. A. Flach, “Kernels and distances for structured data,” Machine a Learning, vol. 57, no. 3, pp. 205–232, December 2004. [8] N. Cristianini, J. Shawe-Taylor, and H. Lodhi, “Latent semantic kernels,” Journal of Intelligent Information Systems, vol. 18, no. 2, pp. 127–152, March 2002. [9] Z. Cao, T. Qin, T. Y. Liu, M. F. Tsai, and H. Li, “Learning to rank: from pairwise approach to listwise approach,” in ICML ’07: Proceedings of the 24th international conference on Machine learning. New York, NY, USA: ACM, 2007, pp. 129–136. [10] K. Grauman and T. Darrell, “The pyramid match kernel: discriminative classification with sets of image features,” in Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on, vol. 2, 2005.

You May Also Find These Documents Helpful

  • Good Essays

    When I first learned of the visual rhetorical analysis and began to read the accompanying instructions, not going to lie, I was a little worried. My first thought was “how am I supposed to talk, let alone, care enough about an image to write, at a minimum, 1250 words about it.” I mean they do say a picture speaks a 1000 words, but they don’t say anything about 1250. (This was all before we received the updated instructions, which by coincidence or not, changed the word count to 1000.) The next step in my process came when I initially saw the four images which we were presented with as options, and from the first glance I knew exactly the one I was going to write about. I chose the WWF’s representation of lungs and the environment because,…

    • 403 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    the similarity of medical reports is evaluated by calculating the semantic characteristics and syntactic similarity. It relies on an upgraded radiology-specific ontology to measure semantic similarity relationships between unstructured mammographic report concepts. While [7] improved the vector cosine similarity algorithm model which uses (is-a) relationships to measure the degree of similarity. For a fixed concept, after examining all the possible paths they arrived at the conclusion that the shortest similarity vector would be selected for each document then the cosine angle of each vector is calculated to determine the degree of similarity. testing has been done by comparing multiple clinical context reports using anatomy and imaging procedures…

    • 117 Words
    • 1 Page
    Satisfactory Essays
  • Good Essays

    In his work Bundesen discusses 3 types of perceptual categories, namely a color category, a shape category and a location category \cite{tva}. Our image retrieval model is founded mainly on color analysis. Therefore, we can follow the notation proposed in \cite{tva} with the assumption that perceptual categories are represented solely by colors.…

    • 389 Words
    • 2 Pages
    Good Essays
  • Best Essays

    K.P.Soman, R.Loganathan and V.Ajay, Machine Learning with SVM and other Kernel methods. PHI Learning private Ltd., 2009.…

    • 3416 Words
    • 14 Pages
    Best Essays
  • Powerful Essays

    Face Recognition

    • 33246 Words
    • 133 Pages

    Preface IX Part 1 Chapter 1 Statistical Face Models & Classifiers A Review of Hidden Markov Models in Face Recognition 3 Claudia Iancu and Peter M. Corcoran GMM vs SVM for Face…

    • 33246 Words
    • 133 Pages
    Powerful Essays
  • Powerful Essays

    unit 1

    • 12040 Words
    • 49 Pages

    Multimedia Technology Multimedia Technology By: Vishal Bharti Assistant Professor(CSE) Unit‐1: Basics of Multimedia Technology h l Multimedia: The term multimedia comes from the Latin words “multus” which means “numerous” and “Media” means “middle” or centre. In other words we can say that multimedia means “Multiple Intermediaries” or “Multiple Means”.…

    • 12040 Words
    • 49 Pages
    Powerful Essays
  • Powerful Essays

    H.3.3 [Information Search and Retrieval]: Retrieval models. J.4 [Social and Behavioral Sciences]: Sociology. Algorithms, Experimentation. Mobile Phone Data, Semantic Label, Trajectory Data Analysis.…

    • 1498 Words
    • 6 Pages
    Powerful Essays
  • Satisfactory Essays

    Content-Based Image Retrieval (CBIR) allows to automatically extracting target images according to objective visual contents of the image itself. Representation of visual features and similarity match are important issues in CBIR. In this paper a novel CBIR method is proposed by exploit the wavelets which represent the visual feature. We use Haar and D4 wavelet to decompose color images into multilevel scale and wavelet coefficients, with which we perform image feature extraction and similarity match by means of F-norm theory. Furthermore, we also provide a progressive image retrieval strategy to achieve flexible CBIR. We tested five categories of color images in the experiments. The retrieval performance of D4 and Haar wavelet is compared with wavelet histograms in terms of recall rate and retrieval speed. Experiment results reflect the importance of wavelets in CBIR and F-norm theory along with progressive retrieval strategy achieves efficient retrieval.…

    • 3812 Words
    • 16 Pages
    Satisfactory Essays
  • Better Essays

    A histogram of a color image can be a useful representation of that image for the purpose of image retrieval and object recognition. A histogram counts the number of pixels of each kind and can be rapidly created by reading each image pixel just once and incrementing the appropriate bin of histogram. The intersection of…

    • 799 Words
    • 4 Pages
    Better Essays
  • Good Essays

    There are different parameters of SVM that can be considered for the purpose of optimizing the accuracy of classification by the SVM. The first parameter that has been considered in these experiments is the type of kernel…

    • 2132 Words
    • 9 Pages
    Good Essays
  • Good Essays

    An image is split into smallest regions that LBP histograms are extracted and then concatenated in to a single feature vector. This vector forms an efficient representation of the face area and can be used to measure similarities between images. LBP features are effective and efficient for facial expression recognition. It will take much time. It is easy to implement.…

    • 803 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Face detection

    • 3231 Words
    • 13 Pages

    with upright frontal faces, several systems have been developed that are able to detect faces fairly accurately with in-plane…

    • 3231 Words
    • 13 Pages
    Good Essays
  • Good Essays

    Multimedia and Graphics

    • 742 Words
    • 3 Pages

     Exhibit some change over time  e.g., video, animation and sound  Presentation of media are usually supplied with player controls: start, stop and pause…

    • 742 Words
    • 3 Pages
    Good Essays
  • Good Essays

    image compression

    • 315 Words
    • 1 Page

    Compressing compound images with a single algorithm that simultaneously meets the requirements for text, image and graphics has been elusive and thus requires new algorithms that can competently reduce the file size without degrading the quality. The compound image compression performance basically depends on the segmentation result. A segmentation process is used where regions of similar data types are grouped together. After successful segmentation existing techniques that best suits each data type can be used to achieve best compression results [4]. Segmentation algorithms for compound image compression are normally categorized as block based, object based and layer based segmentation. Each and every method has its own merits and demerits. Most of the recent researches in this field are mainly based on either layer based or block based. In object based method a page is divided into regions, where each region follows exact object boundaries. An object may be a photograph, a graphical object, a letter, etc. The main drawback of this method is its complexity. In layer-based method, a page is divided into rectangular layers [1]. Most layered coding algorithms use the standard three layer mixed raster content representation [5]. Some traditional compressors like DjVu [6], Digipaper [7] and JPEG 2000 [8] are also available.…

    • 315 Words
    • 1 Page
    Good Essays
  • Good Essays

    Multimedia

    • 2431 Words
    • 10 Pages

    Multimedia may be broadly divided into linear and non-linear categories. Linear active content progresses often without any navigational control for the viewer such as a cinema presentation. Non-linear uses interactivity to control progress as with a video game or self-paced computer based training. Hypermedia is an example of non-linear content.…

    • 2431 Words
    • 10 Pages
    Good Essays