# data-release
**Repository Path**: pxd1998/data-release
## Basic Information
- **Project Name**: data-release
- **Description**: No description available
- **Primary Language**: Unknown
- **License**: Not specified
- **Default Branch**: main
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2026-04-20
- **Last Updated**: 2026-04-20
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
Huawei Cloud Production FaaS Trace Data Releases
This repository contains public releases of Huawei Cloud production serverless FaaS traces made available to the research and academic community by Huawei's Systems Infrastructure Research (SIR) lab in Edinburgh, UK.
These public traces are hashed versions of our raw production logs of tens of billions of user requests over multiple data centers and many months. We release them to enable researchers to conduct realistic simulations and train machine learning models to improve scheduling and resource allocation in cloud platforms.
The data is analyzed in two papers:
* EuroSys 2025: ***Serverless Cold Starts and Where to Find Them***
* Links to our paper PDF: [](https://arxiv.org/abs/2410.06145) [](https://dl.acm.org/doi/10.1145/3689031.3696073) [](https://github.com/sir-lab/data-release/blob/edits/papers/Serverless_Cold_Starts_and_Where_to_Find_Them_EuroSys_2025.pdf)
* Conference video presentation: [](https://youtu.be/AAy3e3FXlRo?si=ywH882aH3enC54pv) [](https://bili.im/8YGufiJ)
* ACM SoCC 2023: ***How Does It Function? Characterizing Long-term Trends in Production Serverless Workloads***
* Links to our paper PDF: [](https://arxiv.org/abs/2312.10127) [](https://dl.acm.org/doi/10.1145/3620678.3624783) [](https://github.com/sir-lab/data-release/blob/main/papers/SoCC_2023_How_does_it_function.pdf)
* Conference video presentation: [](https://www.youtube.com/watch?v=fNhd7vIJgRc) [](https://www.bilibili.tv/en/video/4789602907980800)
## How to download the data
The datasets used in our papers can be downloaded at the links below.
* Huawei Public cold start traces 2025 contains 85 billion raw user requests and 11.9 cold start events in 5 regions. There are 19 metrics per function over 31 days, as well as aggregated time series formats for convenience.
* Huawei Public and Public traces 2023 contains 1.4 trillion function requests in time series format. There are 8 metrics per function over 235 days at per-minute and per-second granularity for two serverless platforms.
In some cases, you may not need all files in a zip folder. You can use 7zip to drag and drop the desired files or directories without extracting the entire archive.
## Code
To get started using the datasets, look at our notebooks for tips on how to load files and visualize the data.
* Huawei Public cold starts 2025 notebook
* Huawei Private 2023 notebook
* Huawei Public 2023 notebook
To run our notebooks with the required packages installed, you can install our conda environment as follows:
```bash
conda env create -f environment.yml
conda activate trace-analysis
```
## Contact us
We welcome feedback, collaboration, or questions. Feel free to open an [Issue](https://github.com/sir-lab/data-release/issues).
These traces and associated research result from a collaboration between the Systems Infrastructure Research (SIR) lab in Edinburgh (part of Huawei Research UK) and Huawei's YuanRong serverless cloud platform team.
Licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.