PyTerrier demonstration for msmarcov2_passage

This notebook demonstrates retrieval using PyTerrier on the MSMARCO v2 Passage Ranking corpus.

About the corpus: A revised corpus of 138M passages released by Microsoft in July 2021, which should be ranked based on their relevance to questions. Also used by the TREC 2021 Deep Learning track.

In [1]:
#!pip install -q python-terrier
import pyterrier as pt
if not pt.started():
    pt.init()

from pyterrier.measures import *
dataset = pt.get_dataset('msmarcov2_passage')
        
PyTerrier 0.7.0 has loaded Terrier 5.6 (built by craigmacdonald on 2021-09-17 13:27)

Systems using index variant terrier_stemmed

Terrier's default Porter stemming, and stopwords removed.

In [2]:
#!pip install git+https://github.com/Georgetown-IR-Lab/OpenNIR.git
import onir_pt
# Lets use a Vanilla BERT ranker from OpenNIR. We'll use the Capreolus model available from Huggingface
vanilla_bert = onir_pt.reranker('hgf4_joint', ranker_config={'model': 'Capreolus/bert-base-msmarco', 'norm': 'softmax-2'})
Better speed can be achieved with apex installed from https://www.github.com/nvidia/apex.
In [3]:
bm25_terrier_stemmed = pt.BatchRetrieve.from_dataset('msmarcov2_passage', 'terrier_stemmed', wmodel='BM25')

bm25_bert_terrier_stemmed = (
    pt.BatchRetrieve.from_dataset('msmarcov2_passage', 'terrier_stemmed') 
    >> pt.text.get_text(pt.get_dataset('irds:msmarco-passage-v2'), 'text') 
    >> onir_pt.reranker('hgf4_joint', ranker_config={'model': 'Capreolus/bert-base-msmarco', 'norm': 'softmax-2'}))

dph_terrier_stemmed = pt.BatchRetrieve.from_dataset('msmarcov2_passage', 'terrier_stemmed', wmodel='DPH')

dph_bert_terrier_stemmed = (
    pt.BatchRetrieve.from_dataset('msmarcov2_passage', 'terrier_stemmed',  wmodel='DPH') 
    >> pt.text.get_text(pt.get_dataset('irds:msmarco-passage-v2'), 'text') 
    >> onir_pt.reranker('hgf4_joint', ranker_config={'model': 'Capreolus/bert-base-msmarco', 'norm': 'softmax-2'}))
21:30:20.154 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading lookup file directly from disk (SLOW) - try index.meta.index-source=fileinmem in the index properties file. 1 GiB of memory would be required.
21:30:20.208 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading data file directly from disk (SLOW) - try index.meta.data-source=fileinmem in the index properties file. 8.6 GiB of memory would be required.
21:30:34.314 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading lookup file directly from disk (SLOW) - try index.meta.index-source=fileinmem in the index properties file. 1 GiB of memory would be required.
21:30:34.333 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading data file directly from disk (SLOW) - try index.meta.data-source=fileinmem in the index properties file. 8.6 GiB of memory would be required.
21:30:54.725 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading lookup file directly from disk (SLOW) - try index.meta.index-source=fileinmem in the index properties file. 1 GiB of memory would be required.
21:30:54.744 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading data file directly from disk (SLOW) - try index.meta.data-source=fileinmem in the index properties file. 8.6 GiB of memory would be required.
21:31:08.351 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading lookup file directly from disk (SLOW) - try index.meta.index-source=fileinmem in the index properties file. 1 GiB of memory would be required.
21:31:08.369 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading data file directly from disk (SLOW) - try index.meta.data-source=fileinmem in the index properties file. 8.6 GiB of memory would be required.

Systems using index variant terrier_unstemmed

Terrier index, no stemming, no stopword removal.

In [4]:
#!pip install git+https://github.com/Georgetown-IR-Lab/OpenNIR.git
import onir_pt
# Lets use a Vanilla BERT ranker from OpenNIR. We'll use the Capreolus model available from Huggingface
vanilla_bert = onir_pt.reranker('hgf4_joint', ranker_config={'model': 'Capreolus/bert-base-msmarco', 'norm': 'softmax-2'})
In [5]:
bm25_terrier_unstemmed = pt.BatchRetrieve.from_dataset('msmarcov2_passage', 'terrier_unstemmed', wmodel='BM25')

bm25_bert_terrier_unstemmed = (
    pt.BatchRetrieve.from_dataset('msmarcov2_passage', 'terrier_unstemmed') 
    >> pt.text.get_text(pt.get_dataset('irds:msmarco-passage-v2'), 'text') 
    >> onir_pt.reranker('hgf4_joint', ranker_config={'model': 'Capreolus/bert-base-msmarco', 'norm': 'softmax-2'}))

dph_terrier_unstemmed = pt.BatchRetrieve.from_dataset('msmarcov2_passage', 'terrier_unstemmed', wmodel='DPH')

dph_bert_terrier_unstemmed = (
    pt.BatchRetrieve.from_dataset('msmarcov2_passage', 'terrier_unstemmed',  wmodel='DPH') 
    >> pt.text.get_text(pt.get_dataset('irds:msmarco-passage-v2'), 'text') 
    >> onir_pt.reranker('hgf4_joint', ranker_config={'model': 'Capreolus/bert-base-msmarco', 'norm': 'softmax-2'}))
21:31:58.787 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading lookup file directly from disk (SLOW) - try index.meta.index-source=fileinmem in the index properties file. 1 GiB of memory would be required.
21:31:58.812 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading data file directly from disk (SLOW) - try index.meta.data-source=fileinmem in the index properties file. 5.7 GiB of memory would be required.
21:32:13.006 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading lookup file directly from disk (SLOW) - try index.meta.index-source=fileinmem in the index properties file. 1 GiB of memory would be required.
21:32:13.029 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading data file directly from disk (SLOW) - try index.meta.data-source=fileinmem in the index properties file. 5.7 GiB of memory would be required.
21:32:34.051 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading lookup file directly from disk (SLOW) - try index.meta.index-source=fileinmem in the index properties file. 1 GiB of memory would be required.
21:32:34.068 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading data file directly from disk (SLOW) - try index.meta.data-source=fileinmem in the index properties file. 5.7 GiB of memory would be required.
21:32:47.585 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading lookup file directly from disk (SLOW) - try index.meta.index-source=fileinmem in the index properties file. 1 GiB of memory would be required.
21:32:47.605 [main] WARN org.terrier.structures.BaseCompressingMetaIndex - Structure meta reading data file directly from disk (SLOW) - try index.meta.data-source=fileinmem in the index properties file. 5.7 GiB of memory would be required.

Evaluation on dev1 topics and qrels

4,552 topics with sparse judgements

In [6]:
pt.Experiment(
    [bm25_terrier_stemmed, bm25_bert_terrier_stemmed, dph_terrier_stemmed, dph_bert_terrier_stemmed, bm25_terrier_unstemmed, bm25_bert_terrier_unstemmed, dph_terrier_unstemmed, dph_bert_terrier_unstemmed],
    pt.get_dataset('msmarcov2_passage').get_topics('dev1'),
    pt.get_dataset('msmarcov2_passage').get_qrels('dev1'),
    batch_size=200,
    filter_by_qrels=True,
    eval_metrics=[RR@10],
    names=['bm25_terrier_stemmed', 'bm25_bert_terrier_stemmed', 'dph_terrier_stemmed', 'dph_bert_terrier_stemmed', 'bm25_terrier_unstemmed', 'bm25_bert_terrier_unstemmed', 'dph_terrier_unstemmed', 'dph_bert_terrier_unstemmed'])
        
21:32:54.584 [main] WARN org.terrier.applications.batchquerying.TRECQuery - trec.encoding is not set; resorting to platform default (ISO-8859-1). Retrieval may be platform dependent. Recommend trec.encoding=UTF-8
config file not found: config
[2021-10-25 22:38:19,897][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-25 22:38:30,939][onir_pt][DEBUG] [starting] batches
                                                             
[2021-10-25 23:08:15,969][onir_pt][DEBUG] [finished] batches: [29:45] [47918it] [26.84it/s]
[2021-10-25 23:10:30,154][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-25 23:10:30,162][onir_pt][DEBUG] [starting] batches
                                                              
[2021-10-25 23:41:06,390][onir_pt][DEBUG] [finished] batches: [30:36] [48614it] [26.47it/s]
[2021-10-25 23:43:42,061][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-25 23:43:42,070][onir_pt][DEBUG] [starting] batches
batches:  72%|███████▏  | 35770/50000 [17:34<06:59, 33.93it/s]
[2021-10-26 01:16:46,549][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 01:16:46,557][onir_pt][DEBUG] [starting] batches
batches:  58%|█████▊    | 29067/49759 [09:19<06:38, 52.03it/s]
[2021-10-26 01:35:07,972][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 01:35:07,980][onir_pt][DEBUG] [starting] batches
                                                            
[2021-10-26 02:24:28,571][onir_pt][DEBUG] [finished] batches: [29:45] [49542it] [27.75it/s]
[2021-10-26 02:27:02,712][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 02:27:02,725][onir_pt][DEBUG] [starting] batches
                                                              
[2021-10-26 02:58:23,507][onir_pt][DEBUG] [finished] batches: [31:21] [50000it] [26.58it/s]
[2021-10-26 03:01:07,379][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 03:01:07,389][onir_pt][DEBUG] [starting] batches
                                                              
[2021-10-26 05:12:16,177][onir_pt][DEBUG] [finished] batches: [29:39] [48865it] [27.47it/s]
[2021-10-26 05:14:25,056][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 05:14:25,063][onir_pt][DEBUG] [starting] batches
batches:  53%|█████▎    | 26374/49328 [15:59<13:55, 27.50it/s]
[2021-10-26 06:18:45,483][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 06:18:45,493][onir_pt][DEBUG] [starting] batches
batches:  86%|████████▌ | 42361/49519 [26:18<04:27, 26.85it/s]
[2021-10-26 07:25:07,162][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 07:25:07,169][onir_pt][DEBUG] [starting] batches
batches:  83%|████████▎ | 20575/24807 [12:59<02:40, 26.40it/s]
[2021-10-26 09:34:03,272][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 09:34:03,422][onir_pt][DEBUG] [starting] batches
batches:  82%|████████▏ | 39903/48614 [24:23<05:19, 27.27it/s]
[2021-10-26 10:37:23,214][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 10:37:23,224][onir_pt][DEBUG] [starting] batches
batches:  86%|████████▌ | 42616/49760 [25:56<04:21, 27.39it/s]
[2021-10-26 11:10:10,826][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 11:10:10,835][onir_pt][DEBUG] [starting] batches
batches:  87%|████████▋ | 43003/49592 [26:24<04:03, 27.14it/s]
[2021-10-26 11:43:01,280][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 11:43:01,288][onir_pt][DEBUG] [starting] batches
batches:  88%|████████▊ | 44116/50000 [27:05<03:37, 27.14it/s]
[2021-10-26 12:16:45,320][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 12:16:45,328][onir_pt][DEBUG] [starting] batches
batches:  89%|████████▉ | 44711/50000 [27:13<03:13, 27.38it/s]
[2021-10-26 13:24:09,195][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 13:24:09,204][onir_pt][DEBUG] [starting] batches
batches:  87%|████████▋ | 42940/49542 [26:01<04:00, 27.50it/s]
[2021-10-26 13:56:41,488][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 13:56:41,502][onir_pt][DEBUG] [starting] batches
                                                              
[2021-10-26 17:09:55,196][onir_pt][DEBUG] [finished] batches: [30:11] [48909it] [27.01it/s]
[2021-10-26 17:12:36,475][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 17:12:36,482][onir_pt][DEBUG] [starting] batches
                                                              
[2021-10-26 17:42:44,900][onir_pt][DEBUG] [finished] batches: [30:08] [49328it] [27.28it/s]
[2021-10-26 17:47:05,634][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 17:47:05,643][onir_pt][DEBUG] [starting] batches
                                                            
[2021-10-26 18:16:57,496][onir_pt][DEBUG] [finished] batches: [29:52] [49014it] [27.35it/s]
[2021-10-26 18:22:15,168][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-26 18:22:15,176][onir_pt][DEBUG] [starting] batches
batches:  54%|█████▍    | 13473/24807 [04:35<03:51, 49.05it/s]
[2021-10-27 09:02:42,331][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 09:02:42,461][onir_pt][DEBUG] [starting] batches
                                                             
[2021-10-27 09:18:08,130][onir_pt][DEBUG] [finished] batches: [15:26] [50000it] [54.02it/s]
[2021-10-27 10:08:47,618][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 10:08:47,627][onir_pt][DEBUG] [starting] batches
                                                             
[2021-10-27 10:24:06,537][onir_pt][DEBUG] [finished] batches: [15:19] [50000it] [54.41it/s]
[2021-10-27 11:03:11,014][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 11:03:11,022][onir_pt][DEBUG] [starting] batches
                                                             
[2021-10-27 11:18:41,168][onir_pt][DEBUG] [finished] batches: [15:30] [50000it] [53.76it/s]
[2021-10-27 11:56:33,481][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 11:56:33,490][onir_pt][DEBUG] [starting] batches
batches: 100%|█████████▉| 49871/50000 [15:14<2.36s, 54.58it/s]
[2021-10-27 12:42:00,415][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 12:42:00,424][onir_pt][DEBUG] [starting] batches
batches:  35%|███▍      | 17441/50000 [05:21<10:00, 54.25it/s]
[2021-10-27 13:46:44,397][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 13:46:44,408][onir_pt][DEBUG] [starting] batches
batches:  84%|████████▍ | 42124/49903 [13:06<02:25, 53.59it/s]
[2021-10-27 14:42:05,781][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 14:42:05,795][onir_pt][DEBUG] [starting] batches
batches:  42%|████▏     | 20939/50000 [06:30<09:01, 53.68it/s]
[2021-10-27 15:42:52,179][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 15:42:52,188][onir_pt][DEBUG] [starting] batches
batches:  49%|████▉     | 24443/50000 [07:30<07:51, 54.30it/s]
[2021-10-27 16:39:53,481][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 16:39:53,492][onir_pt][DEBUG] [starting] batches
batches:  49%|████▉     | 24434/50000 [07:33<07:54, 53.97it/s]
[2021-10-27 17:34:20,077][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 17:34:20,085][onir_pt][DEBUG] [starting] batches
batches:  43%|████▎     | 21337/50000 [06:43<09:01, 53.00it/s]
[2021-10-27 18:33:26,188][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 18:33:26,201][onir_pt][DEBUG] [starting] batches
batches:  62%|██████▏   | 31206/50000 [09:45<05:52, 53.37it/s]
[2021-10-27 19:37:38,050][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 19:37:38,059][onir_pt][DEBUG] [starting] batches
batches:  50%|████▉     | 24755/50000 [07:45<07:54, 53.21it/s]
[2021-10-27 20:37:46,752][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 20:37:46,760][onir_pt][DEBUG] [starting] batches
batches:  64%|██████▎   | 31853/50000 [09:38<05:30, 55.06it/s]
[2021-10-27 21:19:14,952][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 21:19:14,960][onir_pt][DEBUG] [starting] batches
                                                              
[2021-10-27 21:34:38,897][onir_pt][DEBUG] [finished] batches: [15:24] [50000it] [54.12it/s]
[2021-10-27 22:15:25,986][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 22:15:25,994][onir_pt][DEBUG] [starting] batches
batches:  62%|██████▏   | 31059/50000 [09:33<05:49, 54.20it/s]
[2021-10-27 23:10:01,753][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-27 23:10:01,763][onir_pt][DEBUG] [starting] batches
batches:  42%|████▏     | 21087/50000 [06:26<08:49, 54.67it/s]
[2021-10-28 00:16:42,803][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-28 00:16:42,812][onir_pt][DEBUG] [starting] batches
batches:  63%|██████▎   | 31734/50000 [09:59<05:45, 52.95it/s]
[2021-10-28 01:13:45,257][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-28 01:13:45,266][onir_pt][DEBUG] [starting] batches
batches:  46%|████▌     | 22903/50000 [07:01<08:19, 54.35it/s]
[2021-10-28 02:24:39,939][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-28 02:24:39,949][onir_pt][DEBUG] [starting] batches
batches:  32%|███▏      | 8288/25750 [02:30<05:17, 55.17it/s]
[2021-10-28 16:59:36,275][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-28 16:59:36,366][onir_pt][DEBUG] [starting] batches
                                                             
[2021-10-28 17:14:52,487][onir_pt][DEBUG] [finished] batches: [15:16] [50000it] [54.58it/s]
[2021-10-28 18:05:28,569][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-28 18:05:28,578][onir_pt][DEBUG] [starting] batches
                                                             
[2021-10-28 18:20:48,747][onir_pt][DEBUG] [finished] batches: [15:20] [50000it] [54.34it/s]
[2021-10-28 19:04:18,027][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-28 19:04:18,034][onir_pt][DEBUG] [starting] batches
                                                             
[2021-10-28 19:19:51,938][onir_pt][DEBUG] [finished] batches: [15:34] [50000it] [53.54it/s]
[2021-10-28 20:02:10,821][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-28 20:02:10,831][onir_pt][DEBUG] [starting] batches
                                                             
[2021-10-28 20:17:40,187][onir_pt][DEBUG] [finished] batches: [15:29] [50000it] [53.80it/s]
[2021-10-28 20:49:24,545][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-28 20:49:24,555][onir_pt][DEBUG] [starting] batches
batches:  64%|██████▍   | 32006/50000 [09:57<05:35, 53.64it/s]
[2021-10-28 21:54:34,062][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-28 21:54:34,072][onir_pt][DEBUG] [starting] batches
batches:  70%|███████   | 35036/49903 [11:00<04:40, 53.05it/s]
[2021-10-28 22:48:38,466][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-28 22:48:38,475][onir_pt][DEBUG] [starting] batches
batches:  55%|█████▍    | 27339/50000 [08:31<07:04, 53.46it/s]
[2021-10-28 23:47:56,782][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-28 23:47:56,791][onir_pt][DEBUG] [starting] batches
batches:  55%|█████▌    | 27586/50000 [08:31<06:55, 53.99it/s]
[2021-10-29 00:43:33,088][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-29 00:43:33,098][onir_pt][DEBUG] [starting] batches
batches:  55%|█████▌    | 27715/50000 [08:30<06:50, 54.34it/s]
[2021-10-29 01:34:58,282][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-29 01:34:58,290][onir_pt][DEBUG] [starting] batches
batches:  43%|████▎     | 21332/50000 [06:42<09:01, 53.02it/s]
[2021-10-29 02:31:13,576][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-29 02:31:13,585][onir_pt][DEBUG] [starting] batches
batches:  55%|█████▌    | 27676/50000 [08:39<06:59, 53.30it/s]
[2021-10-29 03:32:41,936][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-29 03:32:41,944][onir_pt][DEBUG] [starting] batches
batches:  56%|█████▌    | 27825/50000 [08:44<06:58, 53.06it/s]
[2021-10-29 04:36:49,761][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-29 04:36:49,770][onir_pt][DEBUG] [starting] batches
batches:  64%|██████▎   | 31861/50000 [09:51<05:37, 53.87it/s]
[2021-10-29 05:20:57,878][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-29 05:20:57,887][onir_pt][DEBUG] [starting] batches
batches:  77%|███████▋  | 38336/50000 [11:50<03:36, 54.00it/s]
[2021-10-29 06:15:23,882][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-29 06:15:23,893][onir_pt][DEBUG] [starting] batches
batches:  72%|███████▏  | 35915/50000 [11:02<04:20, 54.24it/s]
[2021-10-29 07:08:18,362][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-29 07:08:18,373][onir_pt][DEBUG] [starting] batches
batches:  46%|████▌     | 22774/50000 [06:56<08:18, 54.70it/s]
[2021-10-29 08:12:30,959][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-29 08:12:30,969][onir_pt][DEBUG] [starting] batches
batches:  63%|██████▎   | 31558/50000 [09:58<05:49, 52.80it/s]
[2021-10-29 09:07:56,318][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-29 09:07:56,330][onir_pt][DEBUG] [starting] batches
batches:  46%|████▌     | 22906/50000 [07:06<08:24, 53.78it/s]
[2021-10-29 10:18:30,728][onir_pt][DEBUG] using GPU (deterministic)
[2021-10-29 10:18:30,734][onir_pt][DEBUG] [starting] batches
batches:  75%|███████▌  | 19352/25750 [06:02<01:60, 53.43it/s]