Detail publikačního výsledku

Cascaded Stripe Memory Engines for Multi-Scale Object Detection in FPGA

MUSIL, P.; JURÁNEK, R.; MUSIL, M.; ZEMČÍK, P.

Originální název

Cascaded Stripe Memory Engines for Multi-Scale Object Detection in FPGA

Anglický název

Cascaded Stripe Memory Engines for Multi-Scale Object Detection in FPGA

Druh

Článek WoS

Originální abstrakt

Object detection in embedded systems is important for many contemporary applications that involve vision and scene analysis. In this paper, we propose a novel architecture for object detection implemented in FPGA, based on the Stripe Memory Engine (SME), and point out shortcomings of existing architectures. SME processes a stream of image data so that it stores a narrow stripe of the input image and its scaled versions and uses a detector unit which is efficiently pipelined across multiple image positions within the SME. We show how to process images with up to 4K resolution at high framerates using cascades of SMEs. As a detector algorithm, the SMEs use boosted soft cascade with simple image features that require only pixel comparisons and look-up tables; therefore, they are well suitable for hardware implemenation. We describe the components of our architecture and compare it to several published works in several configurations. As an example, we implemented face detection and license plate detection applications that work with HD images (1280×720 pixels) running at over 60 frames per second on Xilinx Zynq platform. We analyzed their power consumption, evaluated the accuracy of our detectors, and compared them to Haar Cascades from OpenCV that are often used by other authors. We show that our detectors offer better accuracy as well as performance at lower power consumption.

Anglický abstrakt

Object detection in embedded systems is important for many contemporary applications that involve vision and scene analysis. In this paper, we propose a novel architecture for object detection implemented in FPGA, based on the Stripe Memory Engine (SME), and point out shortcomings of existing architectures. SME processes a stream of image data so that it stores a narrow stripe of the input image and its scaled versions and uses a detector unit which is efficiently pipelined across multiple image positions within the SME. We show how to process images with up to 4K resolution at high framerates using cascades of SMEs. As a detector algorithm, the SMEs use boosted soft cascade with simple image features that require only pixel comparisons and look-up tables; therefore, they are well suitable for hardware implemenation. We describe the components of our architecture and compare it to several published works in several configurations. As an example, we implemented face detection and license plate detection applications that work with HD images (1280×720 pixels) running at over 60 frames per second on Xilinx Zynq platform. We analyzed their power consumption, evaluated the accuracy of our detectors, and compared them to Haar Cascades from OpenCV that are often used by other authors. We show that our detectors offer better accuracy as well as performance at lower power consumption.

Klíčová slova

Object Detection, AdaBoost, WaldBoost, Acceleration, FPGA

Klíčová slova v angličtině

Object Detection, AdaBoost, WaldBoost, Acceleration, FPGA

Autoři

MUSIL, P.; JURÁNEK, R.; MUSIL, M.; ZEMČÍK, P.

Rok RIV

2020

Vydáno

01.01.2020

ISSN

1051-8215

Periodikum

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

Svazek

30

Číslo

1

Stát

Spojené státy americké

Strany od

267

Strany do

280

Strany počet

13

URL

BibTex

@article{BUT161413,
  author="Petr {Musil} and Roman {Juránek} and Martin {Musil} and Pavel {Zemčík}",
  title="Cascaded Stripe Memory Engines for Multi-Scale Object Detection in FPGA",
  journal="IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY",
  year="2020",
  volume="30",
  number="1",
  pages="267--280",
  doi="10.1109/TCSVT.2018.2886476",
  issn="1051-8215",
  url="https://ieeexplore.ieee.org/document/8573854"
}

Dokumenty