Deep Image Captioning: An Overview (CROSBI ID 677330)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Hrga, Ingrid ; Ivašić-Kos, Marina
engleski
Deep Image Captioning: An Overview
Image captioning is a process of automatically describing an image with one or more natural language sentences. In recent years, image captioning has witnessed rapid progress, from initial template-based models to the current ones, based on deep neural networks. This paper gives an overview of issues and recent image captioning research, with a particular emphasis on models that use the deep encoder-decoder architecture. We discuss the advantages and disadvantages of different approaches, along with reviewing some of the most commonly used evaluation metrics and datasets.
image captioning ; encoder-decoder ; attention mechanism ; deep neural networks
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
1179-1184.
2019.
objavljeno
10.23919/MIPRO.2019.8756821
Podaci o matičnoj publikaciji
Proceedings of 42nd International ICT Convention – MIPRO 2019
Biljanović, Petar (ur.).
Opatija: Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO
1847-3946
Podaci o skupu
MIPRO 2019
predavanje
20.05.2019-24.05.2019
Opatija, Hrvatska
Povezanost rada
Informacijske i komunikacijske znanosti, Računarstvo