Update README.md
Browse files
    	
        README.md
    CHANGED
    
    | @@ -3,4 +3,29 @@ license: mit | |
| 3 | 
             
            datasets:
         | 
| 4 | 
             
            - RUC-DataLab/DataScience-Instruct-500K
         | 
| 5 | 
             
            pipeline_tag: table-question-answering
         | 
| 6 | 
            -
            ---
         | 
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | 
|  | |
| 3 | 
             
            datasets:
         | 
| 4 | 
             
            - RUC-DataLab/DataScience-Instruct-500K
         | 
| 5 | 
             
            pipeline_tag: table-question-answering
         | 
| 6 | 
            +
            ---
         | 
| 7 | 
            +
            <p align="center" width="100%">
         | 
| 8 | 
            +
            <img src="assets/logo.png" alt="DeepAnalyze" style="width: 60%; min-width: 300px; display: block; margin: auto;">
         | 
| 9 | 
            +
            </p>
         | 
| 10 | 
            +
             | 
| 11 | 
            +
            # DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
         | 
| 12 | 
            +
            [](https://arxiv.org/abs/2510.16872)
         | 
| 13 | 
            +
            [](https://github.com/ruc-datalab/DeepAnalyze)
         | 
| 14 | 
            +
            [](https://ruc-deepanalyze.github.io/)
         | 
| 15 | 
            +
            [](https://huggingface.co/RUC-DataLab/DeepAnalyze-8B)
         | 
| 16 | 
            +
            [](https://huggingface.co/datasets/RUC-DataLab/DataScience-Instruct-500K)
         | 
| 17 | 
            +
            
         | 
| 18 | 
            +
             | 
| 19 | 
            +
            > **Authors**: **[Shaolei Zhang](https://zhangshaolei1998.github.io/), [Ju Fan*](http://iir.ruc.edu.cn/~fanj/), [Meihao Fan](https://scholar.google.com/citations?user=9RTm2qoAAAAJ), [Guoliang Li](https://dbgroup.cs.tsinghua.edu.cn/ligl/), [Xiaoyong Du](http://info.ruc.edu.cn/jsky/szdw/ajxjgcx/jsjkxyjsx1/js2/7374b0a3f58045fc9543703ccea2eb9c.htm)**
         | 
| 20 | 
            +
             | 
| 21 | 
            +
             | 
| 22 | 
            +
            **DeepAnalyze** is the first agentic LLM for autonomous data science. It can autonomously complete a wide range of data-centric tasks without human intervention, supporting:
         | 
| 23 | 
            +
            - 🛠 **Entire data science pipeline**: Automatically perform any data science tasks such as data preparation, analysis, modeling, visualization, and report generation.
         | 
| 24 | 
            +
            - 🔍 **Open-ended data research**: Conduct deep research on diverse data sources, including structured data (Databases, CSV, Excel), semi-structured data (JSON, XML, YAML), and unstructured data (TXT, Markdown), and finally produce analyst-grade research reports.
         | 
| 25 | 
            +
            - 📊 **Fully open-source**: The [model](https://huggingface.co/RUC-DataLab/DeepAnalyze-8B), [code](https://github.com/ruc-datalab/DeepAnalyze), [training data](https://huggingface.co/datasets/RUC-DataLab/DataScience-Instruct-500K), and [demo](https://huggingface.co/RUC-DataLab/DeepAnalyze-8B) of DeepAnalyze are all open-sourced, allowing you to deploy or extend your own data analysis assistant.
         | 
| 26 | 
            +
             | 
| 27 | 
            +
            <p align="center" width="100%">
         | 
| 28 | 
            +
            <img src="./assets/deepanalyze.jpg" alt="deepanalyze" style="width: 70%; min-width: 300px; display: block; margin: auto;">
         | 
| 29 | 
            +
            </p>
         | 
| 30 | 
            +
             | 
| 31 | 
            +
            More information refer to [DeepAnalyze's Repo](https://github.com/ruc-datalab/DeepAnalyze)
         | 
