Robust Ranking

Installation:

Run pip install -r requirements.txt

Training/ Evaluation

Training Bi-Encoder

Run the script generic_training.py for training based on the AIDA, lcquad and mintaka data Run the script training_msmarco.py fro training on MSMARCO

Training Cross Encoder

Run the script train_cross_encoder.py to train a cross-encoder training on MSMARCO

Configuration

for further settings see parameters.py.

The evaluation scores are computed on the fly during training

Evaluation MS MARCO

Use the scrip eval_ms_marco_model.py

Noise

the implementation for the noise can be seen in the file optimizers/noise.py

for loading the data the according datasets have to be downloaded from the according repository lcquad2:https://github.com/AskNowQA/LC-QuAD2.0 mintaka: https://github.com/amazon-science/mintaka for aida the files has to be in nif format:https://github.com/dice-group/gerbil, and for msmarco dataset:https://github.com/microsoft/MSMARCO-Document-Ranking For preprocessing the MS MARCO dataset the script msmarco_preprocessing provides some code to generate th e required dictionaries and other files, used during training. a more detailed description will be published with the repository.

Name		Name	Last commit message	Last commit date
Latest commit History 173 Commits
cross_encoder_data		cross_encoder_data
models		models
nif		nif
optimizers		optimizers
Evaluator.py		Evaluator.py
MS_marco_collator.py		MS_marco_collator.py
README.md		README.md
Trainer.py		Trainer.py
collator.py		collator.py
data_processing.py		data_processing.py
eval_existing.py		eval_existing.py
eval_ms_marco_model.py		eval_ms_marco_model.py
extract_relations.py		extract_relations.py
extract_script.py		extract_script.py
faiss-hswf-index-test		faiss-hswf-index-test
generic_training.py		generic_training.py
indexing.py		indexing.py
ms_marco_data_handler.py		ms_marco_data_handler.py
msmarco_preprocessing.py		msmarco_preprocessing.py
parameters.py		parameters.py
requirements.txt		requirements.txt
train_augmentation.py		train_augmentation.py
train_cross_encoder.py		train_cross_encoder.py
train_relation_augmentation.py		train_relation_augmentation.py
training_msmarco.py		training_msmarco.py
update_labels.py		update_labels.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robust Ranking

Installation:

Training/ Evaluation

Training Bi-Encoder

Training Cross Encoder

Configuration

Evaluation MS MARCO

Noise

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Robust Ranking

Installation:

Training/ Evaluation

Training Bi-Encoder

Training Cross Encoder

Configuration

Evaluation MS MARCO

Noise

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages