Spaces:
Running
Running
Commit
Β·
3cf16fb
1
Parent(s):
5cc5417
docs: update README to focus on data engineering tools
Browse files- Update description to emphasize systematic benchmarking for data engineering
- Remove Go-related content (web services, concurrency, HTTP handlers)
- Focus on Python data processing, ETL pipelines, data transformation
- Add documentation generation for technical docs and API documentation
- Align project structure with data engineering focus
README.md
CHANGED
|
@@ -19,12 +19,12 @@ tags:
|
|
| 19 |
|
| 20 |
# DataEngEval
|
| 21 |
|
| 22 |
-
A comprehensive evaluation platform for
|
| 23 |
|
| 24 |
## π Features
|
| 25 |
|
| 26 |
-
- **Multi-use-case evaluation**: SQL generation, Python
|
| 27 |
-
- **Real-world datasets**: NYC Taxi,
|
| 28 |
- **Comprehensive metrics**: Correctness, execution success, syntax validation, performance
|
| 29 |
- **Remote inference**: Uses Hugging Face Inference API (no local model downloads)
|
| 30 |
- **Mock mode**: Works without API keys for demos
|
|
@@ -37,9 +37,12 @@ A comprehensive evaluation platform for AI models across SQL generation and code
|
|
| 37 |
- **Metrics**: Correctness, execution, result matching, dialect compliance
|
| 38 |
|
| 39 |
### Code Generation
|
| 40 |
-
- **Python**:
|
| 41 |
-
- **
|
| 42 |
-
|
|
|
|
|
|
|
|
|
|
| 43 |
|
| 44 |
## ποΈ Project Structure
|
| 45 |
|
|
@@ -59,8 +62,8 @@ dataeng-leaderboard/
|
|
| 59 |
β βββ utils/ # Utility functions
|
| 60 |
βββ tasks/ # Multi-use-case datasets
|
| 61 |
β βββ sql_generation/ # SQL generation tasks
|
| 62 |
-
β βββ code_generation/
|
| 63 |
-
β βββ documentation/ #
|
| 64 |
βββ prompts/ # SQL generation templates
|
| 65 |
βββ test/ # Test files
|
| 66 |
```
|
|
|
|
| 19 |
|
| 20 |
# DataEngEval
|
| 21 |
|
| 22 |
+
A comprehensive evaluation platform for systematically benchmarking performance across various models and programming languages, focusing on data engineering tools and technologies.
|
| 23 |
|
| 24 |
## π Features
|
| 25 |
|
| 26 |
+
- **Multi-use-case evaluation**: SQL generation, Python data processing, documentation generation
|
| 27 |
+
- **Real-world datasets**: NYC Taxi queries, data transformation algorithms, technical documentation
|
| 28 |
- **Comprehensive metrics**: Correctness, execution success, syntax validation, performance
|
| 29 |
- **Remote inference**: Uses Hugging Face Inference API (no local model downloads)
|
| 30 |
- **Mock mode**: Works without API keys for demos
|
|
|
|
| 37 |
- **Metrics**: Correctness, execution, result matching, dialect compliance
|
| 38 |
|
| 39 |
### Code Generation
|
| 40 |
+
- **Python**: Data processing algorithms, ETL pipelines, data transformation functions
|
| 41 |
+
- **Metrics**: Syntax correctness, execution success, data processing accuracy, code quality
|
| 42 |
+
|
| 43 |
+
### Documentation Generation
|
| 44 |
+
- **Technical Documentation**: API documentation, system architecture, data pipeline documentation
|
| 45 |
+
- **Metrics**: Content accuracy, completeness, technical clarity, formatting quality
|
| 46 |
|
| 47 |
## ποΈ Project Structure
|
| 48 |
|
|
|
|
| 62 |
β βββ utils/ # Utility functions
|
| 63 |
βββ tasks/ # Multi-use-case datasets
|
| 64 |
β βββ sql_generation/ # SQL generation tasks
|
| 65 |
+
β βββ code_generation/ # Python data processing tasks
|
| 66 |
+
β βββ documentation/ # Technical documentation tasks
|
| 67 |
βββ prompts/ # SQL generation templates
|
| 68 |
βββ test/ # Test files
|
| 69 |
```
|