From 324f8ce2807c36cf7884171f66807670ef21814d Mon Sep 17 00:00:00 2001
From: zcr <cgzhou@aidlab.hk>
Date: Tue, 17 Mar 2026 11:31:45 +0800
Subject: [PATCH] 1

---
 DATASET.md | 239 +++++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 239 insertions(+)
 create mode 100644 DATASET.md
diff --git a/DATASET.md b/DATASET.md
new file mode 100644
index 0000000..bf21580
--- /dev/null
+++ b/DATASET.md
@@ -0,0 +1,239 @@
+# TRELLIS-500K
+
+TRELLIS-500K is a dataset of 500K 3D assets curated from [Objaverse(XL)](https://objaverse.allenai.org/), [ABO](https://amazon-berkeley-objects.s3.amazonaws.com/index.html), [3D-FUTURE](https://tianchi.aliyun.com/specials/promotion/alibaba-3d-future), [HSSD](https://huggingface.co/datasets/hssd/hssd-models), and [Toys4k](https://github.com/rehg-lab/lowshot-shapebias/tree/main/toys4k), filtered based on aesthetic scores.
+This dataset serves for 3D generation tasks.
+
+The dataset is provided as csv files containing the 3D assets' metadata.
+
+## Dataset Statistics
+
+The following table summarizes the dataset's filtering and composition:
+
+***NOTE: Some of the 3D assets lack text captions. Please filter out such assets if captions are required.***
+| Source | Aesthetic Score Threshold | Filtered Size | With Captions |
+|:-:|:-:|:-:|:-:|
+| ObjaverseXL (sketchfab) | 5.5 | 168307 | 167638 |
+| ObjaverseXL (github) | 5.5 | 311843 | 306790 |
+| ABO | 4.5 | 4485 | 4390 |
+| 3D-FUTURE | 4.5 | 9472 | 9291 |
+| HSSD | 4.5 | 6670 | 6661 |
+| All (training set) | - | 500777 | 494770 |
+| Toys4k (evaluation set) | 4.5 | 3229 | 3180 |
+
+## Dataset Location
+
+The dataset is hosted on Hugging Face Datasets. You can preview the dataset at
+
+[https://huggingface.co/datasets/JeffreyXiang/TRELLIS-500K](https://huggingface.co/datasets/JeffreyXiang/TRELLIS-500K)
+
+There is no need to download the csv files manually. We provide toolkits to load and prepare the dataset.
+
+## Dataset Toolkits
+
+We provide [toolkits](dataset_toolkits) for data preparation.
+
+### Step 1: Install Dependencies
+
+```
+. ./dataset_toolkits/setup.sh
+```
+
+### Step 2: Load Metadata
+
+First, we need to load the metadata of the dataset.
+
+```
+python dataset_toolkits/build_metadata.py <SUBSET> --output_dir <OUTPUT_DIR> [--source <SOURCE>]
+```
+
+- `SUBSET`: The subset of the dataset to load. Options are `ObjaverseXL`, `ABO`, `3D-FUTURE`, `HSSD`, and `Toys4k`.
+- `OUTPUT_DIR`: The directory to save the data.
+- `SOURCE`: Required if `SUBSET` is `ObjaverseXL`. Options are `sketchfab` and `github`.
+
+For example, to load the metadata of the ObjaverseXL (sketchfab) subset and save it to `datasets/ObjaverseXL_sketchfab`, we can run:
+
+```
+python dataset_toolkits/build_metadata.py ObjaverseXL --source sketchfab --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+### Step 3: Download Data
+
+Next, we need to download the 3D assets.
+
+```
+python dataset_toolkits/download.py <SUBSET> --output_dir <OUTPUT_DIR> [--rank <RANK> --world_size <WORLD_SIZE>]
+```
+
+- `SUBSET`: The subset of the dataset to download. Options are `ObjaverseXL`, `ABO`, `3D-FUTURE`, `HSSD`, and `Toys4k`.
+- `OUTPUT_DIR`: The directory to save the data.
+
+You can also specify the `RANK` and `WORLD_SIZE` of the current process if you are using multiple nodes for data preparation.
+
+For example, to download the ObjaverseXL (sketchfab) subset and save it to `datasets/ObjaverseXL_sketchfab`, we can run: 
+
+***NOTE: The example command below sets a large `WORLD_SIZE` for demonstration purposes. Only a small portion of the dataset will be downloaded.***
+
+```
+python dataset_toolkits/download.py ObjaverseXL --output_dir datasets/ObjaverseXL_sketchfab --world_size 160000
+```
+
+Some datasets may require interactive login to Hugging Face or manual downloading. Please follow the instructions given by the toolkits.
+
+After downloading, update the metadata file with:
+
+```
+python dataset_toolkits/build_metadata.py ObjaverseXL --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+### Step 4: Render Multiview Images (& Calculate Aesthetic Scores)
+
+Multiview images can be rendered with:
+
+```
+python dataset_toolkits/render.py <SUBSET> --output_dir <OUTPUT_DIR> [--num_views <NUM_VIEWS>] [--rank <RANK> --world_size <WORLD_SIZE>]
+```
+
+- `SUBSET`: The subset of the dataset to render. Options are `ObjaverseXL`, `ABO`, `3D-FUTURE`, `HSSD`, and `Toys4k`.
+- `OUTPUT_DIR`: The directory to save the data.
+- `NUM_VIEWS`: The number of views to render. Default is 150.
+- `RANK` and `WORLD_SIZE`: Multi-node configuration.
+
+For example, to render the ObjaverseXL (sketchfab) subset and save it to `datasets/ObjaverseXL_sketchfab`, we can run:
+
+```
+python dataset_toolkits/render.py ObjaverseXL --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+(Optional) If you want to calculate the aesthetic scores of your own rendered datasets, you can use the following command:
+
+```
+python dataset_toolkits/calculate_aesthetic_scores.py --output_dir <OUTPUT_DIR> [--rank <RANK> --world_size <WORLD_SIZE>]
+```
+- `OUTPUT_DIR`: The directory to save the data.
+- `RANK` and `WORLD_SIZE`: Multi-node configuration.
+
+Don't forget to update the metadata file with:
+
+```
+python dataset_toolkits/build_metadata.py ObjaverseXL --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+### Step 5: Voxelize 3D Models
+
+We can voxelize the 3D models with:
+
+```
+python dataset_toolkits/voxelize.py <SUBSET> --output_dir <OUTPUT_DIR> [--rank <RANK> --world_size <WORLD_SIZE>]
+```
+
+- `SUBSET`: The subset of the dataset to voxelize. Options are `ObjaverseXL`, `ABO`, `3D-FUTURE`, `HSSD`, and `Toys4k`.
+- `OUTPUT_DIR`: The directory to save the data.
+- `RANK` and `WORLD_SIZE`: Multi-node configuration.
+
+For example, to voxelize the ObjaverseXL (sketchfab) subset and save it to `datasets/ObjaverseXL_sketchfab`, we can run:
+```
+python dataset_toolkits/voxelize.py ObjaverseXL --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+Then update the metadata file with:
+
+```
+python dataset_toolkits/build_metadata.py ObjaverseXL --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+### Step 6: Extract DINO Features
+
+To prepare the training data for SLat VAE, we need to extract DINO features from multiview images and aggregate them into sparse voxel grids.
+
+```
+python dataset_toolkits/extract_features.py --output_dir <OUTPUT_DIR> [--rank <RANK> --world_size <WORLD_SIZE>]
+```
+
+- `OUTPUT_DIR`: The directory to save the data.
+- `RANK` and `WORLD_SIZE`: Multi-node configuration.
+
+
+For example, to extract DINO features from the ObjaverseXL (sketchfab) subset and save it to `datasets/ObjaverseXL_sketchfab`, we can run:
+
+```
+python dataset_toolkits/extract_feature.py --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+Then update the metadata file with:
+
+```
+python dataset_toolkits/build_metadata.py ObjaverseXL --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+### Step 7: Encode Sparse Structures
+
+Encoding the sparse structures into latents to train the first stage generator:
+
+```
+python dataset_toolkits/encode_ss_latent.py --output_dir <OUTPUT_DIR> [--rank <RANK> --world_size <WORLD_SIZE>]
+```
+
+- `OUTPUT_DIR`: The directory to save the data.
+- `RANK` and `WORLD_SIZE`: Multi-node configuration.
+
+For example, to encode the sparse structures into latents for the ObjaverseXL (sketchfab) subset and save it to `datasets/ObjaverseXL_sketchfab`, we can run:
+
+```
+python dataset_toolkits/encode_ss_latent.py --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+Then update the metadata file with:
+
+```
+python dataset_toolkits/build_metadata.py ObjaverseXL --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+### Step 8: Encode SLat
+
+Encoding SLat for second stage generator training:
+
+```
+python dataset_toolkits/encode_latent.py --output_dir <OUTPUT_DIR> [--rank <RANK> --world_size <WORLD_SIZE>]
+```
+
+- `OUTPUT_DIR`: The directory to save the data.
+- `RANK` and `WORLD_SIZE`: Multi-node configuration.
+
+For example, to encode SLat for the ObjaverseXL (sketchfab) subset and save it to `datasets/ObjaverseXL_sketchfab`, we can run:
+
+```
+python dataset_toolkits/encode_latent.py --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+Then update the metadata file with:
+
+```
+python dataset_toolkits/build_metadata.py ObjaverseXL --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+### Step 9: Render Image Conditions
+
+To train the image conditioned generator, we need to render image conditions with augmented views.
+
+```
+python dataset_toolkits/render_cond.py <SUBSET> --output_dir <OUTPUT_DIR> [--num_views <NUM_VIEWS>] [--rank <RANK> --world_size <WORLD_SIZE>]
+```
+
+- `SUBSET`: The subset of the dataset to render. Options are `ObjaverseXL`, `ABO`, `3D-FUTURE`, `HSSD`, and `Toys4k`.
+- `OUTPUT_DIR`: The directory to save the data.
+- `NUM_VIEWS`: The number of views to render. Default is 24.
+- `RANK` and `WORLD_SIZE`: Multi-node configuration.
+
+For example, to render image conditions for the ObjaverseXL (sketchfab) subset and save it to `datasets/ObjaverseXL_sketchfab`, we can run:
+
+```
+python dataset_toolkits/render_cond.py ObjaverseXL --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+Then update the metadata file with:
+
+```
+python dataset_toolkits/build_metadata.py ObjaverseXL --output_dir datasets/ObjaverseXL_sketchfab
+```
+
+