首页 > 专利技术

目标检测论文（尤其针对一些小目标的可能改进方法）

⽬标检测论⽂（尤其针对⼀些⼩⽬标的可能改进⽅法）

2、

基于anchor设计原理解决⼩脸检测不到的问题。

7、

a two-stage cascaded face detection framework：

(1) a Multi-Path Region Proposal Network(MP-RPN)，利⽤3个平⾏特征图的输出预测不同尺度的候选⼈脸区域，嵌有带有上采样过滤的卷积层和新提出的产⽣“难”例采样层。

(2) a Boosted Forests classifier，利⽤候选⼈脸区域内的深层⾯部特征和周围更⼤区域的上下⽂特征，⼤⼤减少 hard negative samples.

8、

先对图⽚上的⼈脸进⾏尺度估计，再在特定尺度上进⾏⼈脸检测（使⽤RPN，只使⽤⼀种anchor，且每次只检测⼀张脸）。不⽤在各个尺度下对⼈脸检测，因此速度较快。

9、

uvlo电路不是使⽤多个CNN⽹络来级联的，⽽是使⽤⼀个CNN中不同⽹络层来做级联。CNN⽹络的前⼏层完成简单的⼈脸检测，后⾯的⽹络完成难度较⼤的⼈脸检测，采⽤data routing机制来使不同的卷积层由不同类型的样本来训练，关注于去掉不同类型的⾮⼈脸样本。同时使⽤body part localization 来辅助⼈脸检测。

10、

核⼼⽅法类似SSD。在⽹络不同阶段引出分⽀检测对应范围的⼈脸。训练时针对不同分组只⽤对应尺度的样本进⾏训练。

11、

基于FCNs的3层级联结构。It first proposes the approximate locations where the faces may be, then aims to find the accurate location by zooming on to the faces. Each level of the FCN cascade is a multi-scale fully-convolutional network, which generates scores at different locations and in different scales. A score map is generated after each FCN stage. Probable regions of face are selected and fed to the next stage. The number of proposals is decreased after each level, and the areas of regions are decreased to more precisely fit the face.

12、

对Faster RCNN的⼀些改进策略： feature concatenation, hard negative mining, multi-scale training, model pretraining, and proper calibration of key parameters.

胎盘提取液13、

对Faster RCNN改进：new multi-task loss function design, , and multi-scale training strategy

14、

multi-scale training, multi-scale testing, light-designed RCNN, keep the small proposals at training and testing stage, directly select top-ranked proposals (e.g., 6000) without NMS in the RPN stage for R-CNN, a vote-based NMS ensemble strategy.

15、

银行复点机propose a context pyramid maxout mechanism for anchor cascade。⼤⼤减少计算量和提⾼检测精度。同时对于训练⼩规模模型也有很⾼的检测精度。

16、

解决⼤尺度变化问题。提出新算法SFace：整合了anchor-based methods(类似RetinaNet)和anchor-free based methods(类似UnitBox)。

-----------------------------------------------------------------------

2、

可看成改版版本的Image Pyramid。

分析了⼩尺度与预训练模型尺度之间的关系, 提出了Scale Normalization for Image Pyramids (SNIP)：在训练中，每次只回传那些⼤⼩在⼀个预先指定范围内的proposal的gradient，⽽忽略掉过⼤或者过⼩的proposal；在测试中，建⽴⼤⼩不同的Image Pyramid，在每张图上都运⾏这样⼀个detector，同样只保留那些⼤⼩在指定范围之内的输出结果，最终在⼀起NMS。这样就可以保证⽹络总是在同样scale的物体上训练，也就是标题中Scale Normalized的意思。

4、

在SSD⽹络基础上，增加了语义分割分⽀和全局激活模块。前者增加低层检测特征，后者通过学习特征通道和⽬标类别的语义关系来进⾏⾼层⽬标检测特征。

7、

large images are broken into small patches as input to a Small Object-Sensitive-CNN (SOS-CNN) modified from a Single Shot Multibox Detector (SSD) framework with a VGG-16 network as the base network to produce patch-level object detection results. Scale invariance is achieved by applying the SOS-CNN on an image pyramid. Then, image-level object detection is obtained by projecting all the patch-level detection results to the image at the original scale.

8、

P-GAN将⼩⽬标的特征映射到相似的⼤⽬标特征上来缩⼩差别，便能将⼩⽬标⾜够近似到⼤⽬标来欺骗判别器，达到⼩⽬标检测的⽬的。

9、

特征⾦字塔⽹络。

10、

提出⼀个对于⼩⽬标检测的标准的端到端的多任务⽣成对抗⽹络（MTGAN），适⽤于任何已有的检测器。In the MTGAN, the generator network produces super-resolved images and the multi-task discriminator network is introduced to distinguish

the real high-resolution images from fake ones, predict object categories, and refine bounding boxes, simultaneously. More importantly, the classification and regression losses are back-propagated to further guide the generator network to produce super-resolved images for easier classification and better localization.

触控开关

11、

rfid标签生产当前特征⾦字塔的设计在如何整合不同尺度的语义信息⽅⾯仍然不够⾼效。本⽂把特征⾦字塔转换为特征的再组合过程，创造性地提出了⼀种⾼度⾮线性但是计算快速的结构将底层表⽰和⾼层语义特征进⾏整合。该⽹络由两个模块组成：全局注意⼒和局部再组合。这两个模块分布能全局和局部地去在不同的空间和尺度上提取任务相关的特征。重要的是，这两个模块具有轻量级、可嵌⼊和可端到端训练的优点。

12、

使⽤SPP模块通过扩⼤⽹络宽度⽽不是增加深度来⽣成⾦字塔形特征图。提出MSCA模块有效地组合了不同规模的上下⽂信息。

13、

提出了Scale Aware Network (SAN)，将来⾃不同尺度的卷积特征映射到尺度不变的⼦空间，并设计了⼀种独特的学习⽅法，纯粹考虑了没有空间信息的渠道之间的关系。所提出的SAN减少了标度空间中的特征差异并提⾼了检测精度。

14、

介绍了⼀种⽣成anchor proposals的改进建议，并对Faster R-CNN进⾏修改，利⽤较⾼分辨率的⼩⽬标的feature maps。

15、口香糖电池

we introduce an improved scheme for generating anchor proposals and propose a modification to Faster R-CNN which leverages higher-resolution feature maps for small objects.

16、

提出Scale-aware Pixel-wise Object Proposal（SPOP）⽹络，可以⽣成具有⾼召回率和平均最佳重叠（ABO）的proposals，即使对于⼩⽬标也是如此。另外，引⼊了⼀个类似分段的像素定位⽹络来密集预测每个像素的对象坐标，并开发了⼀种尺度感知对象定位策略，该策略将来⾃⼤尺⼨和⼩尺⼨⽹络的预测与加权机制相结合，以提⾼各种对象尺⼨的坐标预测精度。

本文发布于:2024-09-21 14:27:57，感谢您对本站的认可！

本文链接：https://www.17tex.com/tex/3/239045.html

上一篇：语音合成技术（深度学习方法简介）

下一篇：抗纤保肝汤慢性乙型肝炎肝纤维化的临床分析

标签：特征尺度检测训练模块提出预测

留言与评论（共有 0 条评论）