Merge pull request #204 from e06084/main

[feat]: add dataeval/dingo tool

Merge pull request #204 from e06084/main
512379ca · Hannibal046 · GitHub · b09d2a33 · cd612ce9 · 512379ca
Unverified Commit 512379ca authored 1 month ago by Hannibal046 Committed by GitHub 1 month ago
--- a/README.md
+++ b/README.md
@@ -370,16 +370,20 @@
 </details>
 <details>
 <summary>Shanghai AI Laboratory</summary>
+  
  - [InternLM2-1.8|7|20B](https://huggingface.co/collections/internlm/internlm2-65b0ce04970888799707893c)
  - [InternLM-Math-7B|20B](https://huggingface.co/collections/internlm/internlm2-math-65b0ce88bf7d3327d0a5ad9f)
  - [InternLM-XComposer2-1.8|7B](https://huggingface.co/collections/internlm/internlm-xcomposer2-65b3706bf5d76208998e7477)
  - [InternVL-2|6|14|26](https://huggingface.co/collections/OpenGVLab/internvl-65b92d6be81c86166ca0dde4)
+
+    
 </details>

 ## LLM Data
 > Reference: [LLMDataHub](https://github.com/Zjh-819/LLMDataHub)
 - [IBM data-prep-kit](https://github.com/IBM/data-prep-kit) - Open-Source Toolkit for Efficient Unstructured Data Processing with Pre-built Modules and Local to Cluster Scalability.
 - [Datatrove](https://github.com/huggingface/datatrove) - Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
+- [Dingo](https://github.com/DataEval/dingo) - Dingo: A Comprehensive Data Quality Evaluation Tool

 ## LLM Evaluation:
 - [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) - A framework for few-shot evaluation of language models.