https://github.com/ncsoft/rescue_drone_dataset
인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋
https://github.com/ncsoft/rescue_drone_dataset
Last synced: 5 months ago
JSON representation
인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋
- Host: GitHub
- URL: https://github.com/ncsoft/rescue_drone_dataset
- Owner: ncsoft
- License: cc-by-4.0
- Created: 2022-12-27T07:38:12.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-01-02T01:27:39.000Z (over 3 years ago)
- Last Synced: 2025-03-02T07:49:29.951Z (over 1 year ago)
- Homepage:
- Size: 9.77 KB
- Stars: 26
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# rescue_drone_dataset
We have captured ten different virtual indoor rescue scenarios by a drone. The video and audio sets were captured on a drone three times for each sequence. We dressed mannequins as firefighters, rescuers, medical staff, and the general person(male, female, child). We arranged mannequins in a different pose for each scenario, and the audio of the rescue request voice was configured differently(male, female, child).
We have captured 30 sets of data for multi-object detection, crowd counting, optical character recognition, speaker recognition, etc. Images were composed of 1920x1080 resolution, and voice data was acquired by a 7-channel microphone (16Khz sampling rate and 1024 chunk size).
This opensource is a collaboration between NCSOFT, UVify Co., Ltd., Sogang University and mpWAV Inc. Additional information about the dataset can be found at the URL below.
URL: https://github.com/uvify/rescue_drone_dataset