I'm trying to replicate tranining for French documents (I didn't find any models published).
I can't find any files I can use for distillation when finetuning with mmarco, in:
https://github.com/hltcoe/BLADE/blob/3e1721b99e2ce09d92134398f7c42c4ab3fabc69/README.md?plain=1#L88C1-L91C1
--q_distil data/query_vector_distil.json \
--p_distil data/passage_vector_distil.jsonl \
--p_offset data/passage_vector_distil.offset
How can I find or produce these files?
Thanks!!
I'm trying to replicate tranining for French documents (I didn't find any models published).
I can't find any files I can use for distillation when finetuning with mmarco, in:
https://github.com/hltcoe/BLADE/blob/3e1721b99e2ce09d92134398f7c42c4ab3fabc69/README.md?plain=1#L88C1-L91C1
How can I find or produce these files?
Thanks!!