Buckets:

TorridFish
/

anypoint-datasets

2.61 TB

33,190 files

Updated 9 days ago

Ctrl+K

Name	Size	Uploaded	Xet hash
ScanRefer		21 days ago	6 items
Scannet		10 days ago	1,513 items
checkpoints		13 days ago	78 items
objaverse_sonata_features		21 days ago	200 items
precomputed_voxel_1_5		18 days ago	+10k items
precomputed_voxel_1_5_eval		13 days ago	285 items
3rscan_sonata_feat.tar.gz	49.7 GB xet	21 days ago	dedec2da
GPT_dataset_qwen25_final_test.json	5.91 MB xet	21 days ago	d709b9f8
GPT_dataset_qwen25_final_train.json	68.3 MB xet	21 days ago	7a2d8e86
GPT_dataset_qwen3vl_final_test.json	6.58 MB xet	21 days ago	a6e7a51c
GPT_dataset_qwen3vl_final_train.json	79.3 MB xet	21 days ago	761261e8
README.md	3.76 kB xet	21 days ago	b4deb0df
checkpoints	469 MB xet	21 days ago	1a9a5000
dense_captioning_train.json	155 MB xet	21 days ago	f434b8e7
dense_captioning_val.json	40.2 MB xet	21 days ago	b14be7a4
global_feat.tar.gz	266 GB xet	21 days ago	a22a0570
grounding_multi3drefer_train.json	975 MB xet	21 days ago	cab5acb0
grounding_multi3drefer_val.json	130 MB xet	21 days ago	25e0ece2
grounding_multi3drefer_val_iou25.json	130 MB xet	21 days ago	d9df6440
grounding_scanrefer_train.json	1.63 GB xet	21 days ago	9f7761cb
leo_proposal_mappings.tar.gz	222 kB xet	21 days ago	0ae6787e
leo_proposals.tar.gz	50.4 MB xet	21 days ago	4a2612e4
local_feat.tar.gz	65.5 GB xet	21 days ago	da7f3ca5
multi3drefer_train.json	198 MB xet	21 days ago	18fabc37
multi3drefer_val_gt.json	51.5 MB xet	21 days ago	b81113cc
multi3drefer_val_samelabel.json	14.9 MB xet	21 days ago	7aec0eba
new_dense_captioning_train.json	155 MB xet	21 days ago	79dd32b8
new_grounding_multi3drefer_train.json	1.02 GB xet	21 days ago	a69c4d41
new_grounding_multi3drefer_val.json	132 MB xet	21 days ago	9aa4b4a0
new_grounding_multi3drefer_val_iou25.json	132 MB xet	21 days ago	3c836309
new_grounding_scanrefer_train.json	1.71 GB xet	21 days ago	c51d1b60
new_norep_dense_captioning_train.json	31.1 MB xet	9 days ago	4b6e7bab
new_norep_grounding_multi3drefer_train.json	205 MB xet	12 days ago	c9dba5fe
new_norep_grounding_scanrefer_train.json	342 MB xet	12 days ago	9a250c22
new_scanrefer_val.json	113 MB xet	21 days ago	93961845
new_scanrefer_val_iou25.json	113 MB xet	21 days ago	c87db5f4
object_captioning_sceneverse_scannet_train.json	22.2 MB xet	11 days ago	68290b99
object_captioning_sceneverse_scannet_train_filtered.json	291 kB xet	11 days ago	80bf8863
object_captioning_sceneverse_scannet_val.json	4.46 MB xet	21 days ago	6ff48a22
object_features.tar.gz	162 GB xet	21 days ago	950c042b
qa_scanqa_train.json	75.7 MB xet	21 days ago	5fe4c9ae
qa_scanqa_val.json	29.9 MB xet	21 days ago	1b3f0de5
qa_sqa3d_train.json	38.6 MB xet	21 days ago	ac5b6c45
qa_sqa3d_val.json	4.75 MB xet	21 days ago	e8951491
qwen25vl7b_global_local_stage1_feat4_1_b64.tar.gz	3.77 GB xet	21 days ago	a7e37ad5
qwen25vl7b_global_local_stage1_feat8_2.tar.gz	470 MB xet	21 days ago	8482dd17
qwen25vl7b_global_local_stage1_feat8_2_b64.tar.gz	3.78 GB xet	21 days ago	449391c0
qwen25vl7b_stage2_stage2_feat4_1.tar.gz	63.2 GB xet	21 days ago	c2fe72b3
scannet_gt_masks_sonata_feat.tar.gz	80 GB xet	21 days ago	2d9fd44f
scannet_scene_masks.tar.gz	26.9 MB xet	21 days ago	88d7f0f9
scannet_sonata_feat.tar.gz	208 GB xet	21 days ago	ada03203
scannetpp_centers.tar.gz	218 kB xet	21 days ago	df26d797
scanrefer_gt_masks.tar.gz	38.7 MB xet	21 days ago	a54bca18
scanrefer_val.json	111 MB xet	21 days ago	275cf97a
scanrefer_val_gt.json	44.2 MB xet	21 days ago	2bbc3a2b
scanrefer_val_iou25.json	111 MB xet	21 days ago	61c43ef4
scanrefer_val_samelabel.json	10 MB xet	21 days ago	158a6644
scene_captioning_3rscan_train.json	20.2 MB xet	21 days ago	ec7dfafd
scene_captioning_scannet_train.json	4.22 MB xet	11 days ago	cc06f355
scene_captioning_scannet_val.json	896 kB xet	21 days ago	2c9d8904
scene_mask.tar.gz	10.9 MB xet	21 days ago	550d05de
sonata_feat.tar.gz	48.1 GB xet	21 days ago	3c65c920
stage2_mixed_train.json	708 MB xet	21 days ago	6f33c011
stage2_voxel_1_5.tar.gz	137 GB xet	21 days ago	10a4eee0
train_mask.tar.gz	218 kB xet	21 days ago	df26d797

README.md

Dataset Format

Each JSON file contains a list of data samples. Every sample uses the global/local token format, where <global> represents the broader context (a scene or an object) and <local> represents a specific part within it (an object or a component).

[
    {
        "conversations": [
            {
                "role": "user",
                "content": "Looking at the scene <global>, explain the appearance of the highlighted object <local> and where it is located."
            },
            {
                "role": "assistant",
                "content": "This object is a tall, narrow cabinet with a light wood-grain finish ..."
            }
        ],
        "global": [
            {
                "id": "036bce3393",
                "feat_path": "data/sonata_feat/036bce3393_down.npz",
                "sample_mask_path": "data/scene_mask/036bce3393_mask_32768.npy"
            }
        ],
        "local": [
            {
                "global_id": "036bce3393",
                "mask_path": "data/train_mask/036bce3393/036bce3393_part_1.npy"
            }
        ],
        "metadata": {
            "tasks": "dense_description",
            "level": "scene-object"
        }
    }
]

Fields

`conversations`

A list of message objects representing the dialogue. The model is trained to predict all assistant turns.

role: Either "user" or "assistant".
content: The message text. User messages contain <global> and <local> placeholders that will be replaced with point cloud embeddings during processing.

`global`

A list of global point cloud entries. Each <global> placeholder in the conversation maps to an entry here (by order).

id: A unique identifier for this global point cloud.
feat_path: Path to the .npz feature file containing feat_down (downsampled point features) and inverse (raw-to-downsampled index mapping).
sample_mask_path: Path to a .npy mask file used for sampling the global features. May be empty for object-level globals.

`local`

A list of local (masked) point cloud entries. Each <local> placeholder in the conversation maps to an entry here (by order).

global_id: References the id of the parent global entry that this local region belongs to.
mask_path: Path to a .npy mask file that selects the specific points within the global point cloud.

`metadata`

tasks: The task type (e.g., "dense_description").
level: The spatial hierarchy level, which determines the semantics of <global> and <local>:

Level	`<global>` means	`<local>` means	Example prompt
`scene-object`	A 3D scene	An object in the scene	"Looking at the scene <global>, explain the appearance of the highlighted object <local>."
`scene-subobject`	A 3D scene	A sub-part of an object in the scene	"Describe the selected object <local> in scene <global> and its surroundings in detail."
`object-subobject`	An individual object	A component of that object	"Describe the selected component <local> of the object <global>, including its shape, proportions, material, color, and exact position."

Data Distribution

Level	Train	Test
`scene-object`	26,453 (40.8%)	3,089 (57.0%)
`scene-subobject`	9,046 (13.9%)	1,002 (18.5%)
`object-subobject`	29,415 (45.3%)	1,329 (24.5%)
Total	64,914	5,420

Files

GPT_dataset_qwen25_final_train.json — Training set (Qwen2.5 format)
GPT_dataset_qwen25_final_test.json — Test set (Qwen2.5 format)
GPT_dataset_qwen3vl_final_train.json — Training set (Qwen3-VL format)
GPT_dataset_qwen3vl_final_test.json — Test set (Qwen3-VL format)

Total size: 2.61 TB

Files: 33,190

Last updated: May 6

Pre-warmed CDN: US EU US EU

Dataset Format

Fields

conversations

global

local

metadata