Logo Kérwá
 

Size does matter: Overcoming limitations during training when using a feature pyramid network

dc.creatorFallas Moya, Fabián
dc.creatorGonzález Hernández, Manfred
dc.creatorSadovnik, Amir
dc.date.accessioned2025-06-20T16:31:33Z
dc.date.issued2022-01-25
dc.description.abstractState-of-the-art object detectors need to be trained with a wide variety of data in order to perform well in real-world problems. Training-data-diversity is very important to achieve good generalization. However, there are scenarios where we have training data with certain limitations. One such scenario is when the objects of the testing set have a different size (discrepancy) from the objects used during training. Another scenario is when we have high-resolution images with a dimension that is not supported by the model. To address these problems, we propose a novel pipeline that is able to handle high-resolution images by cropping the original image into sub-images and put them back in the end. Also, in the case of the discrepancy of object sizes, we propose two different techniques based on scaling the image up and down in order to have an acceptable performance. In addition, we also use the information from the Feature Pyramid Network to remove false-positives. Our proposed methods overcome state-of-the-art data augmentation policies and our models can generalize to different object sizes even though limited data is provided.
dc.description.procedenceUCR::Sedes Regionales::Sede del Atlántico
dc.description.procedenceUCR::Vicerrectoría de Docencia::Ingeniería::Facultad de Ingeniería::Escuela de Ciencias de la Computación e Informática
dc.description.sponsorshipUniversidad de Costa Rica/[510-B9-453]/UCR/Costa Rica
dc.identifier.codproyecto510-B9-453
dc.identifier.doihttps://doi.org/10.1109/ICMLA52953.2021.00249
dc.identifier.isbn978-1-6654-4337-1
dc.identifier.isbn978-1-6654-4337-1
dc.identifier.urihttps://hdl.handle.net/10669/102345
dc.language.isoeng
dc.source2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA). Institute of Electrical and Electronics Engineers
dc.subjectdata augmentation
dc.subjectobject detection
dc.subjectOD
dc.subjectdrone imaging
dc.subjectfeature pyramid network
dc.titleSize does matter: Overcoming limitations during training when using a feature pyramid network
dc.typecomunicación de congreso

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Artículo_SizeDoesMatter.pdf
Size:
1.39 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.5 KB
Format:
Item-specific license agreed upon to submission
Description: