Multi-Conditional Ranking with Large Language Models

This repository is the implementation for the paper "Multi-Conditional Ranking with Large Language Models"

🛠️ Dataset Construction Process

The benchmark evaluates LLMs on a multi-conditional ranking (MCR) task, where the goal is to order items based on various unsorted conditions. Each item is paired with a gold label that identifies its category or feature value, forming the basis for determining the correct ranking. The dataset is organized around two item types—token-level (short text segments) and paragraph-level (up to 150 tokens)—and involves five distinct condition types:

Positional Conditions:
Require placing items in specific positions using nuanced spatial language (e.g., “the last item from the left”), adding complexity beyond basic fixed placements.
Locational Conditions:
Involve ranking items based on geographical attributes, with data drawn from sources such as T-REx and Dice job descriptions.
Temporal Conditions:
Require ranking depend on dates—like birthdates or deadlines—with examples sourced from benchmarks including CACD, Dice, and SQuAD.
Trait-Based Conditions:
Rankings are based on physical features (such as size or height) with items collected from resources like VEC and Amazon reviews.
Reason-Based Conditions:
Demand logical or mathematical reasoning for ranking, using samples from Big-Bench and DROP.

For each condition type, 200 samples were generated, each pairing a condition with a randomly arranged set of items. To mimic realistic scenarios with conflicting priorities, additional conditions are introduced (e.g., a low-priority character count or an extra high-priority positional condition), resulting in 18 unique scenarios (varying by item count and condition combinations).

Upon assigning these priorities, the order of conditions is randomized to further increase the task's complexity, and samples from each condition type are combined to form the final dataset for each scenario. For clarity, samples where multiple items share the same character count are removed, resulting in approximately 1000 curated samples per scenario.

📝 Data Source Attribution

Our benchmarks build upon data derived from several publicly available datasets:

T-REx
- Source: T-REx Website
- License: CC BY-SA 4.0
CACD
- Source: CACD Website
- License: Public Domain
VEC Dataset
- Source: VEC Repository
- License: Public Domain
Big-Bench Dataset
- Source: Big-Bench Repository
- License: Apache-2.0
Dice Job Description Dataset
- Source: Kaggle
- License: CC BY-SA 4.0
SQUAD Dataset
- Source: SQUAD Website
- License: CC BY-SA 4.0
Amazon Reviews Dataset
- Source-1: Amazon Reviews Repository-1
- License: Public Domain
- Source-2: Amazon Reviews Repository-2
- License: Attribution-NonCommercial 4.0 International
Drop Dataset
- Source: Drop Huggingface Page
- License: CC BY-SA 4.0

Please refer to the respective sources for detailed licensing terms.

🧠 Usage Guidelines

Use this dataset for research and educational purposes.
Commercial use may require additional permissions depending on source licenses.

⭐ Citation

If you would like to cite our work, the bibtex is:

@article{pezeshkpour2024multi,
title={Multi-Conditional Ranking with Large Language Models},
author={Pezeshkpour, Pouya and Hruschka, Estevam},
journal={arXiv preprint arXiv:2404.00211},
year={2024}
}

📜 Disclosure

Embedded in, or bundled with, this product are open source software (OSS) components, datasets and other third party components identified below. The license terms respectively governing the datasets and third-party components continue to govern those portions, and you agree to those license terms, which, when applicable, specifically limit any distribution. You may receive a copy of, distribute and/or modify any open source code for the OSS component under the terms of their respective licenses, which may be CC license and Apache 2.0 license. In the event of conflicts between Megagon Labs, Inc., license conditions and the Open Source Software license conditions, the Open Source Software conditions shall prevail with respect to the Open Source Software portions of the software. You agree not to, and are not permitted to, distribute actual datasets used with the OSS components listed below. You agree and are limited to distribute only links to datasets from known sources by listing them in the datasets overview table below. You are permitted to distribute derived datasets of data sets from known sources by including links to original dataset source in the datasets overview table below. You agree that any right to modify datasets originating from parties other than Megagon Labs, Inc. are governed by the respective third party’s license conditions. All OSS components and datasets are distributed WITHOUT ANY WARRANTY, without even implied warranty such as for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE, and without any liability to or claim against any Megagon Labs, Inc. entity other than as explicitly documented in this README document. You agree to cease using any part of the provided materials if you do not agree with the terms or the lack of any warranty herein. While Megagon Labs, Inc., makes commercially reasonable efforts to ensure that citations in this document are complete and accurate, errors may occur. If you see any error or omission, please help us improve this document by sending information to [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
figs		figs
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Conditional Ranking with Large Language Models

🛠️ Dataset Construction Process

📝 Data Source Attribution

🧠 Usage Guidelines

⭐ Citation

📜 Disclosure

About

Releases

Packages

License

megagonlabs/MCR

Folders and files

Latest commit

History

Repository files navigation

Multi-Conditional Ranking with Large Language Models

🛠️ Dataset Construction Process

📝 Data Source Attribution

🧠 Usage Guidelines

⭐ Citation

📜 Disclosure

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages