# wiki-bpe-48k **Repository Path**: hf-datasets/wiki-bpe-48k ## Basic Information - **Project Name**: wiki-bpe-48k - **Description**: Mirror of https://huggingface.co/datasets/cyrilzhang/wiki-bpe-48k - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-06-15 - **Last Updated**: 2025-06-15 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README --- configs: - config_name: default data_files: - split: train path: data/train-* - split: test path: data/test-* dataset_info: features: - name: input_ids sequence: int32 splits: - name: train num_bytes: 20505990100 num_examples: 5001461 - name: test num_bytes: 206143900 num_examples: 50279 download_size: 9547305598 dataset_size: 20712134000 --- # Dataset Card for "wiki-bpe-48k" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)