Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2vec saveload fixes #11

Merged
merged 40 commits into from
Sep 8, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
40 commits
Select commit Hold shift + click to select a range
03c8bb9
Make docs clearer on `alpha` parameter in LDA model
xh2 Jul 24, 2020
7791b74
Merge pull request #1 from xh2/patch-1
xh2 Jul 24, 2020
4e1b09c
Update Hoffman paper link
xh2 Jul 24, 2020
25005c5
rm whitespace
gojomo Jul 26, 2020
f34956c
Update gensim/models/ldamodel.py
piskvorky Jul 26, 2020
7d0ef9e
Update gensim/models/ldamodel.py
piskvorky Jul 26, 2020
a662e8d
Merge pull request #2896 from xh2/bugfix/lda-doc-alpha
piskvorky Jul 26, 2020
78778a9
Update gensim/models/ldamodel.py
piskvorky Jul 26, 2020
344c4ab
Merge pull request #2897 from xh2/bugfix/hoffman-paper-link
piskvorky Jul 26, 2020
b70c826
re-applying changes from #2821
piskvorky Jul 26, 2020
a81e547
migrating + regenerating changed docs
piskvorky Jul 26, 2020
78fe1c4
fix forgotten iteritems
piskvorky Jul 26, 2020
a0e40ca
remove extra `model.wv`
piskvorky Jul 26, 2020
4cf4da0
split overlong doc line
piskvorky Jul 26, 2020
161ad55
get rid of six in doc2vec
piskvorky Jul 27, 2020
31d2b87
increase test timeout for Visdom server
piskvorky Jul 27, 2020
bc95bcb
add 32/64 bits report
gojomo Jul 29, 2020
c834e06
add deprecations for init_sims()
piskvorky Jul 30, 2020
172e37f
remove vectors_norm + add link to migration guide to deprecation warn…
piskvorky Jul 30, 2020
3919b68
rename vectors_norm everywhere, update tests, regen docs
piskvorky Jul 30, 2020
d40f685
put back no-op property setter of deprecated vectors_norm
piskvorky Jul 30, 2020
872c8ed
fix typo
piskvorky Jul 30, 2020
4c1b3f7
fix flake8
piskvorky Jul 30, 2020
b39eec2
disable Keras tests
piskvorky Jul 30, 2020
d5556ea
Merge pull request #2899 from RaRe-Technologies/pr2821
piskvorky Jul 30, 2020
f2fd045
test showing FT failure as W2V
gojomo Jul 22, 2020
7ab1501
set .vectors even when ngrams off
gojomo Jul 22, 2020
ce16168
Update gensim/test/test_fasttext.py
piskvorky Jul 26, 2020
779fe46
Update gensim/test/test_fasttext.py
piskvorky Jul 26, 2020
9289c3b
refresh docs for run_annoy tutorial
piskvorky Aug 3, 2020
4b7e372
Merge pull request #2910 from RaRe-Technologies/rerun_tutorial
piskvorky Aug 3, 2020
b308883
Reduce memory use of the term similarity matrix constructor, deprecat…
Witiko Aug 7, 2020
28a2110
Fix doc2vec crash for large sets of doc-vectors (#2907)
gojomo Aug 17, 2020
817cac9
Fix AttributeError in WikiCorpus (#2901)
jenishah Aug 17, 2020
fc4b97f
intensify cbow+hs tests; bulk testing method
gojomo Sep 2, 2020
030e650
use increment operator
gojomo Sep 2, 2020
6e0d00b
Change num_words to topn in dtm_coherence (#2926)
MeganStodel Sep 3, 2020
d524fa4
Merge branch 'develop' into 2vec_saveload_fixes
piskvorky Sep 7, 2020
49b35b7
docstirng fixes
piskvorky Sep 7, 2020
3f972a6
get rid of python2 constructs
piskvorky Sep 8, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions ISSUE_TEMPLATE.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,7 @@ Please provide the output of:
```python
import platform; print(platform.platform())
import sys; print("Python", sys.version)
import struct; print("Bits", 8 * struct.calcsize("P"))
import numpy; print("NumPy", numpy.__version__)
import scipy; print("SciPy", scipy.__version__)
import gensim; print("gensim", gensim.__version__)
Expand Down
4 changes: 2 additions & 2 deletions docs/notebooks/soft_cosine_tutorial.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -225,7 +225,7 @@
"name": "stdout",
"output_type": "stream",
"text": [
"Number of documents: 3\n",
"Number of documents: 2274338\n",
"CPU times: user 2min 1s, sys: 1.9 s, total: 2min 3s\n",
"Wall time: 2min 56s\n"
]
Expand Down Expand Up @@ -259,7 +259,7 @@
" [preprocess(relcomment[\"RelCText\"]) for relcomment in thread[\"RelComments\"]])\n",
" for thread in api.load(\"semeval-2016-2017-task3-subtaskA-unannotated\")]))\n",
"\n",
"print(\"Number of documents: %d\" % len(documents))"
"print(\"Number of documents: %d\" % len(corpus))"
]
},
{
Expand Down
6 changes: 3 additions & 3 deletions docs/src/_matutils.rst
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
:mod:`_matutils` -- Cython matutils
===================================
:mod:`_matutils` -- Compiled extension for math utils
=====================================================

.. automodule:: gensim._matutils
:synopsis: Cython math utils
:synopsis: Compiled extension for math utils
:members:
:inherited-members:
:undoc-members:
Expand Down
5 changes: 3 additions & 2 deletions docs/src/apiref.rst
Original file line number Diff line number Diff line change
Expand Up @@ -50,6 +50,7 @@ Modules:
models/_fasttext_bin
models/phrases
models/poincare
viz/poincare
models/coherencemodel
models/basemodel
models/callbacks
Expand All @@ -63,7 +64,8 @@ Modules:
models/wrappers/varembed
similarities/docsim
similarities/termsim
similarities/index
similarities/annoy
similarities/nmslib
sklearn_api/atmodel
sklearn_api/d2vmodel
sklearn_api/hdp
Expand Down Expand Up @@ -102,4 +104,3 @@ Modules:
summarization/summariser
summarization/syntactic_unit
summarization/textcleaner
viz/poincare
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
2 changes: 1 addition & 1 deletion docs/src/auto_examples/core/run_similarity_queries.py.md5
Original file line number Diff line number Diff line change
@@ -1 +1 @@
a3eaf7347874a32d1d25a455753206dc
54804120deb345715247f0eed42b5e0e
28 changes: 14 additions & 14 deletions docs/src/auto_examples/core/run_similarity_queries.rst
Original file line number Diff line number Diff line change
Expand Up @@ -142,7 +142,7 @@ no random-walk static ranks, just a semantic extension over the boolean keyword

.. code-block:: none

[(0, 0.4618210045327158), (1, 0.07002766527900064)]
[(0, 0.46182100453271613), (1, 0.07002766527900031)]



Expand Down Expand Up @@ -254,15 +254,15 @@ order, and obtain the final answer to the query `"Human computer interaction"`:

.. code-block:: none

(2, 0.9984453) Human machine interface for lab abc computer applications
(0, 0.998093) A survey of user opinion of computer system response time
(3, 0.9865886) The EPS user interface management system
(1, 0.93748635) System and human system engineering testing of EPS
(4, 0.90755945) Relation of user perceived response time to error measurement
(8, 0.050041765) The generation of random binary unordered trees
(7, -0.09879464) The intersection graph of paths in trees
(6, -0.10639259) Graph minors IV Widths of trees and well quasi ordering
(5, -0.12416792) Graph minors A survey
0.9984453 The EPS user interface management system
0.998093 Human machine interface for lab abc computer applications
0.9865886 System and human system engineering testing of EPS
0.93748635 A survey of user opinion of computer system response time
0.90755945 Relation of user perceived response time to error measurement
0.050041765 Graph minors A survey
-0.09879464 Graph minors IV Widths of trees and well quasi ordering
-0.10639259 The intersection graph of paths in trees
-0.12416792 The generation of random binary unordered trees



Expand Down Expand Up @@ -319,17 +319,17 @@ on large datasets easily, and to facilitate prototyping of new algorithms for re

.. code-block:: none

/Volumes/work/workspace/gensim_misha/docs/src/gallery/core/run_similarity_queries.py:194: UserWarning: Matplotlib is currently using agg, which is a non-GUI backend, so cannot show the figure.
plt.show()
/Volumes/work/workspace/vew/gensim3.6/lib/python3.6/site-packages/matplotlib/figure.py:445: UserWarning: Matplotlib is currently using agg, which is a non-GUI backend, so cannot show the figure.
% get_backend())




.. rst-class:: sphx-glr-timing

**Total running time of the script:** ( 0 minutes 0.663 seconds)
**Total running time of the script:** ( 0 minutes 1.211 seconds)

**Estimated memory usage:** 6 MB
**Estimated memory usage:** 39 MB


.. _sphx_glr_download_auto_examples_core_run_similarity_queries.py:
Expand Down
6 changes: 3 additions & 3 deletions docs/src/auto_examples/core/sg_execution_times.rst
Original file line number Diff line number Diff line change
Expand Up @@ -5,9 +5,9 @@

Computation times
=================
**00:00.844** total execution time for **auto_examples_core** files:
**00:01.211** total execution time for **auto_examples_core** files:

- **00:00.844**: :ref:`sphx_glr_auto_examples_core_run_topics_and_transformations.py` (``run_topics_and_transformations.py``)
- **00:01.211**: :ref:`sphx_glr_auto_examples_core_run_similarity_queries.py` (``run_similarity_queries.py``)
- **00:00.000**: :ref:`sphx_glr_auto_examples_core_run_core_concepts.py` (``run_core_concepts.py``)
- **00:00.000**: :ref:`sphx_glr_auto_examples_core_run_corpora_and_vector_spaces.py` (``run_corpora_and_vector_spaces.py``)
- **00:00.000**: :ref:`sphx_glr_auto_examples_core_run_similarity_queries.py` (``run_similarity_queries.py``)
- **00:00.000**: :ref:`sphx_glr_auto_examples_core_run_topics_and_transformations.py` (``run_topics_and_transformations.py``)
Loading