Revisions of python-nltk
buildservice-autocommit
accepted
request 1160546
from
Daniel Garcia (dgarcia)
(revision 46)
baserev update by copy to link target
Daniel Garcia (dgarcia)
accepted
request 1160467
from
Benjamin Greiner (bnavigator)
(revision 45)
- Update to 3.8.1 * Resolve RCE & XSS vulnerabilities in localhost WordNet Browser * Add Python 3.11 support - Update nltk_data archive - Drop port-2to3.patch - Add nltk-pr3207-py312.patch for Python 3.12 support * gh#nltk/nltk#3207
buildservice-autocommit
accepted
request 1077159
from
Factory Maintainer (factory-maintainer)
(revision 44)
baserev update by copy to link target
Matej Cepl (mcepl)
accepted
request 1074922
from
Petr Gajdos (pgajdos)
(revision 43)
- python-six is not required
buildservice-autocommit
accepted
request 1056667
from
Markéta Machová (mcalabkova)
(revision 42)
baserev update by copy to link target
Markéta Machová (mcalabkova)
accepted
request 1056422
from
Yogalakshmi Arunachalam (yarunachalam)
(revision 41)
- Update to 3.8 * Refactor dispersion plot (#3082) * Provide type hints for LazyCorpusLoader variables (#3081) * Throw warning when LanguageModel is initialized with incorrect vocabulary (#3080) * Fix WordNet's all_synsets() function (#3078) * Resolve TreebankWordDetokenizer inconsistency with end-of-string contractions (#3070) * Support both iso639-3 codes and BCP-47 language tags (#3060) * Avoid DeprecationWarning in Regexp tokenizer (#3055) * Fix many doctests, add doctests to CI (#3054, #3050, #3048) * Fix bool field not being read in VerbNet (#3044) * Greatly improve time efficiency of SyllableTokenizer when tokenizing numbers (#3042) * Fix encodings of Polish udhr corpus reader (#3038) * Allow TweetTokenizer to tokenize emoji flag sequences (#3034) * Prevent LazyModule from increasing the size of nltk.__dict__ (#3033) * Fix CoreNLPServer non-default port issue (#3031) * Add "acion" suffix to the Spanish SnowballStemmer (#3030) * Allow loading WordNet without OMW (#3026) * Use input() in nltk.chat.chatbot() for Jupyter support (#3022) * Fix edit_distance_align() in distance.py (#3017) * Tackle performance and accuracy regression of sentence tokenizer since NLTK 3.6.6 (#3014) * Add the Iota operator to semantic logic (#3010) * Resolve critical errors in WordNet app (#3008) * Resolve critical error in CHILDES Corpus (#2998) * Make WordNet information_content() accept adjective satellites (#2995) * Add "strict=True" parameter to CoreNLP (#2993, #3043) * Resolve issue with WordNet's synset_from_sense_key (#2988) * Handle WordNet synsets that were lost in mapping (#2985) * Resolve TypeError in Boxer (#2979) * Add function to retrieve WordNet synonyms (#2978) * Warn about nonexistent OMW offsets instead of raising an error (#2974)
buildservice-autocommit
accepted
request 1045543
from
Matej Cepl (mcepl)
(revision 40)
baserev update by copy to link target
Matej Cepl (mcepl)
committed
(revision 39)
- Clean up the SPEC to get rid of rpmlint warnings.
Matej Cepl (mcepl)
committed
(revision 38)
- Complete nltk_data.tar.xz for offline testing - Fix failing tests (gh#nltk/nltk#2969) by adding patches: - port-2to3.patch - skip-networked-test.patch
buildservice-autocommit
accepted
request 965220
from
Dirk Mueller (dirkmueller)
(revision 37)
baserev update by copy to link target
Matej Cepl (mcepl)
committed
(revision 36)
- Update to 3.7 - Improve and update the NLTK team page on nltk.org (#2855, #2941) - Drop support for Python 3.6, support Python 3.10 (#2920) - Update to 3.6.7 - Resolve IndexError in `sent_tokenize` and `word_tokenize` (#2922) - Update to 3.6.6 - Refactor `gensim.doctest` to work for gensim 4.0.0 and up (#2914) - Add Precision, Recall, F-measure, Confusion Matrix to Taggers (#2862) - Added warnings if .zip files exist without any corresponding .csv files. (#2908) - Fix `FileNotFoundError` when the `download_dir` is a non-existing nested folder (#2910) - Rename omw to omw-1.4 (#2907) - Resolve ReDoS opportunity by fixing incorrectly specified regex (#2906, bsc#1191030, CVE-2021-3828). - Support OMW 1.4 (#2899) - Deprecate Tree get and set node methods (#2900) - Fix broken inaugural test case (#2903) - Use Multilingual Wordnet Data from OMW with newer Wordnet versions (#2889) - Keep NLTKs "tokenize" module working with pathlib (#2896) - Make prettyprinter to be more readable (#2893) - Update links to the nltk book (#2895) - Add `CITATION.cff` to nltk (#2880) - Resolve serious ReDoS in PunktSentenceTokenizer (#2869) - Delete old CI config files (#2881)
buildservice-autocommit
accepted
request 812413
from
Tomáš Chvátal (scarabeus_iv)
(revision 35)
baserev update by copy to link target
Tomáš Chvátal (scarabeus_iv)
committed
(revision 34)
Tomáš Chvátal (scarabeus_iv)
accepted
request 812178
from
John Vandenberg (jayvdb)
(revision 33)
- Update to v3.5 * add support for Python 3.8 * drop support for Python 2 * create NLTK's own Tokenizer class distinct from the Treebank reference tokeniser * update Vader sentiment analyser * fix JSON serialization of some PoS taggers * minor improvements in grammar.CFG, Vader, pl196x corpus reader, StringTokenizer * change implementation <= and >= for FreqDist so they are partial orders * make FreqDist iterable * correctly handle Penn Treebank trees with a unlabeled branching top node
buildservice-autocommit
accepted
request 787913
from
Dirk Mueller (dirkmueller)
(revision 32)
baserev update by copy to link target
Dirk Mueller (dirkmueller)
committed
(revision 31)
- Update to 3.4.5 (bsc#1146427, CVE-2019-14751):
buildservice-autocommit
accepted
request 784877
from
Tomáš Chvátal (scarabeus_iv)
(revision 30)
baserev update by copy to link target
Tomáš Chvátal (scarabeus_iv)
committed
(revision 29)
Tomáš Chvátal (scarabeus_iv)
committed
(revision 28)
- Fix build without python2
buildservice-autocommit
accepted
request 738364
from
Matej Cepl (mcepl)
(revision 27)
baserev update by copy to link target
Displaying revisions 1 - 20 of 46