Skip to content

Commit 7d8dafa

Browse files
committed
Updated the explnation and modified import
1 parent 776f101 commit 7d8dafa

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

9 Lexical Dispersion Plot.ipynb

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -94,7 +94,7 @@
9494
"## Explanation \n",
9595
"\n",
9696
"### Tokenisation\n",
97-
"Firstly, we check which language the function is present in. Then we try to sort them accordingly, sending the Indian ones one way, and English and Latin the other. Both these groups have been assigned their own separate tokenizer. We use the CLTK Indian tokenizer for Indian languages and the NLTK `word_tokenize` method for the other two languages.\n",
97+
"Firstly, we check which language the function is present in. Then we try to sort them accordingly, sending the Indian ones one way, and English and Latin the other. Both these groups have been assigned their own separate tokenizer. We use the CLTK `TokenizeSentence()` for Indian languages and the NLTK `word_tokenize` method for the other two languages.\n",
9898
"\n",
9999
"### Locating Matches and Plotting\n",
100100
"This is a pretty straightforward task where we select matches from the text and store their positions in the text in order to display them on the graph. This is achieved using simple loops. It is followed by basic plotting and manipulating data points to produce the lexical dispersion plot."

0 commit comments

Comments
 (0)