Size does matter: Overcoming limitations during training when using a feature pyramid network
dc.creator | Fallas Moya, Fabián | |
dc.creator | González Hernández, Manfred | |
dc.creator | Sadovnik, Amir | |
dc.date.accessioned | 2025-06-20T16:31:33Z | |
dc.date.issued | 2022-01-25 | |
dc.description.abstract | State-of-the-art object detectors need to be trained with a wide variety of data in order to perform well in real-world problems. Training-data-diversity is very important to achieve good generalization. However, there are scenarios where we have training data with certain limitations. One such scenario is when the objects of the testing set have a different size (discrepancy) from the objects used during training. Another scenario is when we have high-resolution images with a dimension that is not supported by the model. To address these problems, we propose a novel pipeline that is able to handle high-resolution images by cropping the original image into sub-images and put them back in the end. Also, in the case of the discrepancy of object sizes, we propose two different techniques based on scaling the image up and down in order to have an acceptable performance. In addition, we also use the information from the Feature Pyramid Network to remove false-positives. Our proposed methods overcome state-of-the-art data augmentation policies and our models can generalize to different object sizes even though limited data is provided. | |
dc.description.procedence | UCR::Sedes Regionales::Sede del Atlántico | |
dc.description.procedence | UCR::Vicerrectoría de Docencia::Ingeniería::Facultad de Ingeniería::Escuela de Ciencias de la Computación e Informática | |
dc.description.sponsorship | Universidad de Costa Rica/[510-B9-453]/UCR/Costa Rica | |
dc.identifier.codproyecto | 510-B9-453 | |
dc.identifier.doi | https://doi.org/10.1109/ICMLA52953.2021.00249 | |
dc.identifier.isbn | 978-1-6654-4337-1 | |
dc.identifier.isbn | 978-1-6654-4337-1 | |
dc.identifier.uri | https://hdl.handle.net/10669/102345 | |
dc.language.iso | eng | |
dc.source | 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA). Institute of Electrical and Electronics Engineers | |
dc.subject | data augmentation | |
dc.subject | object detection | |
dc.subject | OD | |
dc.subject | drone imaging | |
dc.subject | feature pyramid network | |
dc.title | Size does matter: Overcoming limitations during training when using a feature pyramid network | |
dc.type | comunicación de congreso |