Modules:
interfaces
– Core gensim interfacesutils
– Various utility functionsmatutils
– Math utils_matutils
– Cython matutilsdownloader
– Downloader API for gensimcorpora.bleicorpus
– Corpus in Blei’s LDA-C formatcorpora.csvcorpus
– Corpus in CSV formatcorpora.dictionary
– Construct word<->id mappingscorpora.hashdictionary
– Construct word<->id mappingscorpora.indexedcorpus
– Random access to corpus documentscorpora.lowcorpus
– Corpus in GibbsLda++ formatcorpora.malletcorpus
– Corpus in Mallet formatcorpora.mmcorpus
– Corpus in Matrix Market formatcorpora._mmreader
– Reader for corpus in the Matrix Market format.corpora.sharded_corpus
– Corpus stored in separate filescorpora.svmlightcorpus
– Corpus in SVMlight formatcorpora.textcorpus
– Tools for building corpora with dictionariescorpora.ucicorpus
– Corpus in UCI formatcorpora.wikicorpus
– Corpus from a Wikipedia dumpmodels.ldamodel
– Latent Dirichlet Allocation
models.ldamulticore
– parallelized Latent Dirichlet Allocation
models.nmf
– Non-Negative Matrix factorizationmodels.lsimodel
– Latent Semantic Indexingmodels.ldaseqmodel
– Dynamic Topic Modeling in Pythonmodels.tfidfmodel
– TF-IDF modelmodels.rpmodel
– Random Projectionsmodels.hdpmodel
– Hierarchical Dirichlet Processmodels.logentropy_model
– LogEntropy modelmodels.normmodel
– Normalization modelmodels.translation_matrix
– Translation Matrix model
models.lsi_dispatcher
– Dispatcher for distributed LSI
models.lsi_worker
– Worker for distributed LSI
models.lda_dispatcher
– Dispatcher for distributed LDA
models.lda_worker
– Worker for distributed LDA
models.atmodel
– Author-topic modelsmodels.word2vec
– Word2vec embeddings
models.keyedvectors
– Store and query word vectors
models.doc2vec
– Doc2vec paragraph embeddings
models.fasttext
– FastText model
models._fasttext_bin
– Facebook I/Omodels.phrases
– Phrase (collocation) detectionmodels.poincare
– Train and use Poincare embeddingsmodels.coherencemodel
– Topic coherence pipelinemodels.basemodel
– Core TM interfacemodels.callbacks
– Callbacks for track and viz LDA train process
models.utils_any2vec
– Utils for any2vec modelsmodels._utils_any2vec
– Cython utils for any2vec modelsmodels.word2vec_inner
– Cython routines for training Word2Vec modelsmodels.doc2vec_inner
– Cython routines for training Doc2Vec modelsmodels.fasttext_inner
– Cython routines for training FastText modelsmodels.wrappers.ldamallet
– Latent Dirichlet Allocation via Mallet
models.wrappers.dtmmodel
– Dynamic Topic Models (DTM) and Dynamic Influence Models (DIM)
models.wrappers.ldavowpalwabbit
– Latent Dirichlet Allocation via Vowpal Wabbit
models.wrappers.wordrank
– Word Embeddings from WordRank
models.wrappers.varembed
– VarEmbed Word Embeddingsmodels.wrappers.fasttext
– Wrapper for FastText implementation from Facebookmodels.deprecated.doc2vec
– Deep learning with paragraph2vecmodels.deprecated.fasttext
– FastText modelmodels.deprecated.word2vec
– Deep learning with word2vecmodels.deprecated.keyedvectors
– Store and query word vectorsmodels.deprecated.fasttext_wrapper
– Wrapper for Facebook implementation of FastText modelmodels.base_any2vec
– Base classes for any2vec modelssimilarities.docsim
– Document similarity queries
similarities.termsim
– Term similarity queriessimilarities.index
– Fast Approximate Nearest Neighbor Similarity with Annoy package
sklearn_api.atmodel
– Scikit learn wrapper for Author-topic modelsklearn_api.d2vmodel
– Scikit learn wrapper for paragraph2vec modelsklearn_api.hdp
– Scikit learn wrapper for Hierarchical Dirichlet Process modelsklearn_api.ldamodel
– Scikit learn wrapper for Latent Dirichlet Allocationsklearn_api.ldaseqmodel
– Scikit learn wrapper for LdaSeq modelsklearn_api.lsimodel
– Scikit learn wrapper for Latent Semantic Indexingsklearn_api.phrases
– Scikit learn wrapper for phrase (collocation) detectionsklearn_api.rpmodel
– Scikit learn wrapper for Random Projection modelsklearn_api.text2bow
– Scikit learn wrapper word<->id mappingsklearn_api.tfidf
– Scikit learn wrapper for TF-IDF modelsklearn_api.w2vmodel
– Scikit learn wrapper for word2vec modeltest.utils
– Common utils
topic_coherence.aggregation
– Aggregation moduletopic_coherence.direct_confirmation_measure
– Direct confirmation measure moduletopic_coherence.indirect_confirmation_measure
– Indirect confirmation measure moduletopic_coherence.probability_estimation
– Probability estimation moduletopic_coherence.segmentation
– Segmentation moduletopic_coherence.text_analysis
– Analyzing the texts of a corpus to accumulate statistical information about word occurrencesscripts.package_info
– Information about gensim packagescripts.glove2word2vec
– Convert glove format to word2vec
scripts.make_wikicorpus
– Convert articles from a Wikipedia dump to vectors.scripts.word2vec_standalone
– Train word2vec on text file CORPUSscripts.make_wiki_online
– Convert articles from a Wikipedia dumpscripts.make_wiki_online_lemma
– Convert articles from a Wikipedia dumpscripts.make_wiki_online_nodebug
– Convert articles from a Wikipedia dumpscripts.word2vec2tensor
– Convert the word2vec format to Tensorflow 2D tensor
scripts.segment_wiki
– Convert wikipedia dump to json-line format
parsing.porter
– Porter Stemming Algorithm
parsing.preprocessing
– Functions to preprocess raw text
summarization.bm25
– BM25 ranking function
summarization.commons
– Common graph functionssummarization.graph
– Graphsummarization.keywords
– Keywords for TextRank summarization algorithm
summarization.mz_entropy
– Keywords for the Montemurro and Zanette entropy algorithmsummarization.pagerank_weighted
– Weighted PageRank algorithmsummarization.summarizer
– TextRank Summariser
summarization.syntactic_unit
– Syntactic Unit classsummarization.textcleaner
– Summarization pre-processing
viz.poincare
– Visualize Poincare embeddings