Please use this identifier to cite or link to this item:
https://hdl.handle.net/1889/4044
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Prati, Andrea | - |
dc.contributor.author | Magliani, Federico | - |
dc.date.accessioned | 2020-04-18T09:27:46Z | - |
dc.date.available | 2020-04-18T09:27:46Z | - |
dc.date.issued | 2020-03 | - |
dc.identifier.uri | http://hdl.handle.net/1889/4044 | - |
dc.description.abstract | The Content-Based Image Retrieval (CBIR) task is a computer vision problem. The growth of the digital images on the Internet allows to encourage the proposal of solution for this task more than before. The access to this huge quantity of data has allowed the creation of big datasets, that brought with them lots of new challenges. Briefly, the objective of the task is simply to retrieve and rank the similar images to the query one, called retrieval accuracy, that need to be as high as possible. Moreover, there are secondary targets as retrieval time and memory occupancy that need to be as low as possible. The problem is trivial for humans that simply execute this task through experience and semantic perception, but it is not so easy for a computer. This is known as semantic gap, which refers to the gap between low-level image pixels and high-level semantic concepts. Furthermore, the images may contain noisy patches (e.g. trees, person, cars, ...), be taken with different lightning conditions, viewpoints and resolution. In order to solve this problem it is crucial to develop algorithms and techniques with the objective of reducing the weight of the unnecessary patches of the images and that work well with a vast quantity of data. There are several applications of CBIR systems: libraries and museum applications, fashion application for the search of certain clothes, advanced electronic tourist guides. In this thesis a complete pipeline for the resolution of the CBIR problem is presented and then all the steps of the process are evaluated with a particular focus on CNN transfer learning, embeddings, large-scale retrieval and methods based on graphs as diffusion mechanism. All the methods presented are tested on several public image datasets in order to compare the final retrieval results. | it |
dc.language.iso | Italiano | it |
dc.publisher | Università degli Studi di Parma. Dipartimento di Ingegneria e architettura | it |
dc.relation.ispartofseries | Dottorato di ricerca in Tecnologie dell'informazione | it |
dc.rights | © Federico Magliani, 2020 | it |
dc.subject | content-based image retrieval | it |
dc.subject | LSH | it |
dc.subject | R-MAC+ | it |
dc.subject | locVLAD | it |
dc.subject | Bag of Indexes | it |
dc.subject | LSH kNN graph | it |
dc.title | Content-based image retrieval for visual big data analysis | it |
dc.type | Doctoral thesis | it |
dc.subject.miur | ING.INF./05 | it |
Appears in Collections: | Tecnologie dell'informazione. Tesi di dottorato |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
relazione-finale-schema.pdf Until 2100-01-01 | Relazione finale del Dottorato di Federico Magliani. Contiene riassunto attività scientifica, pubblicazioni svolte e attività svolte durante il phd (come revisione di articoli, tirocini all'estero, partecipazione a conferenze) | 64.89 kB | Adobe PDF | View/Open Request a copy |
TesiDottorato.pdf | Tesi di Dottorato di Federico Magliani su Content-Based Image Retrieval | 4.67 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License