OpenBreastUS is a large-scale wave equation dataset designed to bridge the gap between theoretical equations and practical imaging applications, consisting of 8,000 breast phantoms and 16,384,000 wavefields. To represent the distribution of diverse human breast types, the dataset is divided into four groups, each corresponding to a specific breast density type: heterogeneous (HET), fibroglandular (FIB), all fatty (FAT), and extremely dense (EXD).
Breast Type | Frequency (MHz) | Phantoms | Storage |
---|---|---|---|
Heterogeneous (HET) | 0.30-0.65 | 2000 | 7.2TB |
Fibroglandular (FIB) | 0.30-0.65 | 3000 | 10.8TB |
Fatty (FAT) | 0.30-0.65 | 2000 | 7.2TB |
Extremely Dense (EXD) | 0.30-0.65 | 1000 | 3.6TB |
The wavefields are simulated using parameters from a real annular USCT system. We focus on 8 frequencies between 300 kHz and 650 kHz, sampled at 50 kHz intervals, resulting in ROIs with approximately 50 to 100 wavenumbers.
>> run split_data.m
output:
your_project_path/organ_speed/train/train_xx.mat
your_project_path/organ_speed/test/test_xx.mat
This is the runtime interface example. Configure the following parameters to control the data generation process. Set the speed path to your output dir from the last step, and the system will generate data and detailed log files in your specified output directory.
16-core CPU; MATLAB 2020b+; 64GB RAM