Repositories » TXT0 Read More
Clone URL:  
Pushed to one repository · View In Graph Contained in tip

README

Changeset 4cc65e232f6a

Parent d830396677ec

by Rup Palchowdhury

Changes to one file · Browse files at 4cc65e232f6a Showing diff from parent d830396677ec Diff from another changeset...

Change 1 of 1 Show Entire File README Stacked
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
@@ -0,0 +1,22 @@
+Tokenize the corpus and query. + + ./raw2t -x -n -c TRECQUERY <q.txt >q.t + ./raw2t -x -n -c TREC <d.txt >d.t + +Print readable tokenized files (if needed). + + ./t2mem <q.t >q.mem + ./t2mem <d.t >d.mem + +Build inverted index from tokenized corpus and search (-s) using +queries. + + ./ii1 -s q.t <d.t >res + +Rank the search result. + + sort -k1,1 -k3,3nr res >rank + +Convert the result to TREC run format. + + awk -f txt2trecrun.awk <rank >run