--- a/www/HACKING.rst Fri Jul 21 23:19:15 2023 +0300
+++ b/www/HACKING.rst Fri Jul 21 23:23:31 2023 +0300
@@ -325,7 +325,7 @@
* deviation of word frequency across documents in corpus, usually marked by
``D``
-Sorting numerically on first= column::
+Sorting numerically on first column::
$ sort -k 1nr,2 <$IN >$OUT
@@ -360,9 +360,10 @@
http://www.anc.org/data/oanc/download/
OANC download page.
-
http://www.anc.org/data/oanc/
OANC home page.
+https://anc.org/data/anc-second-release/frequency-data/
+ 2nd release of ANC.
https://en.wikipedia.org/wiki/Word_lists_by_frequency
@@ -396,6 +397,8 @@
General Service List at Wikipedia.
http://jbauman.com/aboutgsl.html
About the General Service List by John Bauman.
+https://www.eapfoundation.com/vocab/general/gsl/
+ Sheldon Smith about GSL.
New General Service List
------------------------
@@ -510,7 +513,7 @@
BNC+COCA wordlist
-----------------
-Paul Nation prepare frequency wordlist from combined BNC and COCA corpus:
+Paul Nation prepared a frequency wordlist from combined BNC and COCA corpus:
http://www.victoria.ac.nz/lals/about/staff/paul-nation
Paul Nation's home page and list download page.
@@ -524,6 +527,10 @@
http://www.laurenceanthony.net/software/antwordprofiler/
Laurence Anthony's AntWordProfiler home page.
+https://www.laurenceanthony.net/resources/wordlists/bnc_coca_cleaned_ver_002_20141015.zip
+ Direct download link with 25k words + extra (dated by 2014).
+https://www.wgtn.ac.nz/lals/resources/paul-nations-resources/vocabulary-lists
+ Paul's page at Victoria University with download of wordlist (first 10k).
Oxford 3000/5000
----------------