You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+11-3
Original file line number
Diff line number
Diff line change
@@ -19,9 +19,10 @@ Additionally, we provide a [search toolkit](https://huggingface.co/spaces/zhimin
19
19
|[📓 Data Science Notebook](#data-science-notebook)|[💾 Data Storage Optimisation](#data-storage-optimisation)|[💸 Data Stream Processing](#data-stream-processing)|
|[📜 Model, Data & Experiment Management](#model-data-and-experiment-management)|[🔩 Model Storage Optimisation](#model-storage-optimisation)|[🔏 Privacy & Robustness](#privacy-and-robustness)|[🏁 Training Orchestration](#training-orchestration)|
22
+
|[🗂️ Industry-strength Dataset](#industry-strength-dataset)|[🔥 Industry-strength Information Retrieval](#industry-strength-infret)|[🔠 Industry-strength Natural Language Processing](#industry-strength-nlp)|
*[supervision](https://github.com/roboflow/supervision) - Supervision is a Python library designed for efficient computer vision pipeline management, providing tools for annotation, visualization, and monitoring of models.
394
395
*[VideoSys](https://github.com/NUS-HPC-AI-Lab/VideoSys) - VideoSys supports many diffusion models with our various acceleration techniques, enabling these models to run faster and consume less memory.
395
396
397
+
## Industry Strength Dataset
398
+
*[Dataset Viewer](https://github.com/EpistasisLab/pmlb) - Dataset Viewer is a tool that enables users to interactively explore and analyze datasets by providing functionalities such as pagination, filtering, searching, and basic statistical insights.
399
+
*[DiffusionDB](https://github.com/poloclub/diffusiondb) - DiffusionDB is a large-scale text-to-image prompt gallery dataset based on Stable Diffusion.
400
+
*[PMLB](https://github.com/EpistasisLab/pmlb) - PMLB is a large, curated repository of benchmark datasets for evaluating supervised machine learning algorithms.
401
+
*[SemanticKITTI](https://github.com/PRBonn/semantic-kitti-api) - SemanticKITTI helps developers to navigate, visualize, process, and evaluate results for point clouds and labels from the SemanticKITTI dataset.
402
+
*[UltraFeedback](https://github.com/OpenBMB/UltraFeedback) - UltraFeedback is a large-scale, fine-grained, diverse preference dataset, used for training powerful reward models and critic models.
403
+
396
404
## Industry Strength InfRet
397
405
*[AutoRAG](https://github.com/Marker-Inc-Korea/AutoRAG) - AutoRAG is a RAG AutoML tool for automatically finds an optimal RAG pipeline for your data.
398
406
*[Cognita](https://github.com/truefoundry/cognita) - Cognita is a RAG framework for building modular and production-ready applications.
0 commit comments