preprintarXiv (Cornell University)Oct 23, 2024GREEN OA

YOLOv11: An Overview of the Key Architectural Enhancements

Indexed inarxivdatacite

Abstract

This study presents an architectural analysis of YOLOv11, the latest iteration in the YOLO (You Only Look Once) series of object detection models. We examine the models architectural innovations, including the introduction of the C3k2 (Cross Stage Partial with kernel size 2) block, SPPF (Spatial Pyramid Pooling - Fast), and C2PSA (Convolutional block with Parallel Spatial Attention) components, which contribute in improving the models performance in several ways such as enhanced feature extraction. The paper explores YOLOv11's expanded capabilities across various computer vision tasks, including object detection, instance segmentation, pose estimation, and oriented object detection (OBB). We review the model's…

Citation impact

488
total citations
FWCI
Percentile
References
0
Citations per year

Authors

2

Topics & keywords

Keywords
  • Key (lock)
  • Architectural engineering
  • Computer science
  • Computer architecture
  • Engineering
  • Computer security
No related works found for this paper.