ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models

A.I. Black GuyMay 28, 2024

0 0 1 minute read

ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models

Modern diffusion-based image generative models have made significant progress and become promising to enrich training data for the object detection task. However, the generation quality and the controllability for complex scenes containing multi-class objects and dense objects with occlusions remain limited. This paper presents ODGEN, a novel method to generate high-quality images conditioned on bounding boxes, thereby facilitating data synthesis for object detection. Given a domain-specific object detection dataset, we first fine-tune a pre-trained diffusion model on both cropped foreground objects and entire images to fit target distributions. Then we propose to control the diffusion model using synthesized visual prompts with spatial constraints and object-wise textual descriptions. ODGEN exhibits robustness in handling complex scenes and specific domains. Further, we design a dataset synthesis pipeline to evaluate ODGEN on 7 domain-specific benchmarks to demonstrate its effectiveness. Adding training data generated by ODGEN improves up to 25.3% mAP@.50:.95 with object detectors like YOLOv5 and YOLOv7, outperforming prior controllable generative methods. In addition, we design an evaluation protocol based on COCO-2014 to validate ODGEN in general domains and observe an advantage up to 5.6% in mAP@.50:.95 against existing methods.

Figure: The proposed ODGEN enables controllable image generation from bounding boxes and text prompts. It can generate high-quality data for complex scenes, encompassing multiple categories, dense objects, and occlusions, which can be used to enrich the training data for object detection.

Source link

ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models

Related

A.I. Black Guy

Leave a Reply Cancel reply

Project Mugetsu Legendary Orb Guide – Ultimate Reroll Item

WWE SuperCard QR Codes – 2023!

Bloodtide Secret Codes – Bunker, Vault, and Subway

Widgetable APK/iOS + MOD 1.4.030 (Premium) Download

Camp Buddy MOD APK/iOS v2.2.4 (Unlock All Characters)

Related

A.I. Black Guy

An Introduction to Reinforcement Learning | by Angjelin Hila | May, 2024

Honkai Star Rail Pure Fiction Lexical Enigma guide: Best teams and tips

Related Articles

Salesforce Introduces The New Einstein 1 Platform: Elevating Productivity And Customer Trust Through Data-Driven AI And CRM

Loss minimization through the lens of outcome indistinguishability

Build a Better Bar Chart with This Trick | by Lee Vaughan | Aug, 2023

Make ChatGPT See Again: This AI Approach Explores Link-Context Learning to Enable Multimodal Learning

Leave a Reply Cancel reply