WARNING:gensim.models.base_any2vec:consider setting layer size to a multiple of 4 for greater performance INFO:gensim.models.doc2vec:collecting all words and their counts INFO:gensim.models.doc2vec:PROGRESS: at example #0, processed 0 words (0/s), 0 word types, 0 tags INFO:gensim.models.doc2vec:PROGRESS: at example #10000, processed 1638877 words (6058817/s), 27231 word types, 10000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #20000, processed 3486871 words (5943450/s), 39421 word types, 20000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #30000, processed 5326561 words (5960371/s), 47476 word types, 30000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #40000, processed 6954694 words (5593355/s), 54460 word types, 40000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #50000, processed 8904632 words (5968779/s), 62365 word types, 50000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #60000, processed 10505114 words (5957991/s), 67804 word types, 60000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #70000, processed 12155100 words (5648731/s), 73470 word types, 70000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #80000, processed 13729279 words (5770286/s), 78646 word types, 80000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #90000, processed 15528714 words (6017030/s), 83100 word types, 90000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #100000, processed 17283639 words (5769887/s), 88127 word types, 100000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #110000, processed 18945521 words (5723990/s), 92375 word types, 110000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #120000, processed 20953191 words (5763773/s), 98760 word types, 120000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #130000, processed 22540885 words (5665509/s), 102066 word types, 130000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #140000, processed 24146818 words (5791711/s), 105790 word types, 140000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #150000, processed 25903502 words (5759098/s), 110273 word types, 150000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #160000, processed 27435945 words (5726733/s), 113636 word types, 160000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #170000, processed 29085450 words (5667576/s), 117431 word types, 170000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #180000, processed 30861487 words (5897347/s), 120663 word types, 180000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #190000, processed 32376671 words (6048247/s), 124007 word types, 190000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #200000, processed 34379365 words (6020216/s), 128684 word types, 200000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #210000, processed 35891507 words (5929570/s), 131439 word types, 210000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #220000, processed 37535893 words (5985521/s), 134697 word types, 220000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #230000, processed 39131270 words (6085967/s), 137322 word types, 230000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #240000, processed 40717325 words (5867743/s), 139192 word types, 240000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #250000, processed 42572565 words (6188013/s), 141779 word types, 250000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #260000, processed 44419180 words (6063542/s), 145015 word types, 260000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #270000, processed 46067880 words (5988106/s), 147808 word types, 270000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #280000, processed 47693239 words (6041152/s), 150562 word types, 280000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #290000, processed 49533790 words (6162600/s), 153273 word types, 290000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #300000, processed 51077080 words (6041972/s), 155991 word types, 300000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #310000, processed 53224237 words (6215776/s), 159463 word types, 310000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #320000, processed 54817392 words (5843785/s), 161720 word types, 320000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #330000, processed 56715869 words (6132342/s), 164055 word types, 330000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #340000, processed 58395961 words (6023433/s), 166608 word types, 340000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #350000, processed 59902886 words (5961109/s), 169337 word types, 350000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #360000, processed 61350248 words (5899266/s), 171522 word types, 360000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #370000, processed 62852949 words (5835886/s), 173479 word types, 370000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #380000, processed 64487146 words (5869965/s), 175560 word types, 380000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #390000, processed 66531419 words (5524650/s), 178354 word types, 390000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #400000, processed 68558739 words (5395440/s), 182266 word types, 400000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #410000, processed 70343735 words (5405627/s), 183884 word types, 410000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #420000, processed 71807029 words (5314736/s), 186067 word types, 420000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #430000, processed 73320316 words (5257322/s), 188430 word types, 430000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #440000, processed 74853481 words (5649270/s), 190648 word types, 440000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #450000, processed 76486403 words (5807216/s), 193272 word types, 450000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #460000, processed 78051614 words (5983653/s), 195296 word types, 460000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #470000, processed 79664741 words (5887664/s), 197494 word types, 470000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #480000, processed 81962841 words (5898756/s), 200221 word types, 480000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #490000, processed 83396256 words (5842752/s), 202218 word types, 490000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #500000, processed 85298843 words (5985972/s), 204406 word types, 500000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #510000, processed 87010354 words (5738274/s), 206627 word types, 510000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #520000, processed 88609811 words (5730965/s), 208497 word types, 520000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #530000, processed 90092007 words (5548682/s), 210131 word types, 530000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #540000, processed 91559140 words (5882966/s), 211907 word types, 540000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #550000, processed 93029606 words (5848652/s), 213679 word types, 550000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #560000, processed 94995419 words (6106890/s), 216167 word types, 560000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #570000, processed 97365266 words (6136031/s), 218693 word types, 570000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #580000, processed 99131545 words (5823683/s), 220213 word types, 580000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #590000, processed 101029975 words (5522020/s), 221949 word types, 590000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #600000, processed 102680265 words (5895179/s), 223513 word types, 600000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #610000, processed 104580617 words (5991416/s), 226166 word types, 610000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #620000, processed 106082752 words (5928552/s), 227587 word types, 620000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #630000, processed 107897316 words (6017238/s), 229717 word types, 630000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #640000, processed 109903329 words (5960734/s), 232012 word types, 640000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #650000, processed 111485930 words (5961369/s), 233686 word types, 650000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #660000, processed 113011922 words (6010396/s), 235588 word types, 660000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #670000, processed 114729171 words (6048883/s), 237279 word types, 670000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #680000, processed 116450871 words (6018002/s), 238885 word types, 680000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #690000, processed 118061949 words (5922482/s), 240527 word types, 690000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #700000, processed 119839279 words (6036249/s), 242713 word types, 700000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #710000, processed 122360873 words (6135490/s), 246317 word types, 710000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #720000, processed 124254832 words (5996788/s), 248531 word types, 720000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #730000, processed 126059567 words (6047883/s), 250775 word types, 730000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #740000, processed 127752034 words (6046122/s), 252483 word types, 740000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #750000, processed 129547076 words (6026196/s), 254364 word types, 750000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #760000, processed 131184884 words (6011311/s), 255853 word types, 760000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #770000, processed 132748586 words (5969905/s), 257522 word types, 770000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #780000, processed 134480007 words (6078933/s), 258992 word types, 780000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #790000, processed 136097760 words (6044623/s), 260269 word types, 790000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #800000, processed 137789152 words (5978629/s), 261293 word types, 800000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #810000, processed 139541509 words (6013304/s), 262955 word types, 810000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #820000, processed 141142653 words (5941385/s), 264018 word types, 820000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #830000, processed 142773531 words (6060622/s), 265293 word types, 830000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #840000, processed 144446025 words (5491733/s), 267522 word types, 840000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #850000, processed 145955862 words (5694335/s), 269127 word types, 850000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #860000, processed 147410188 words (5970138/s), 270245 word types, 860000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #870000, processed 149040518 words (5920498/s), 271463 word types, 870000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #880000, processed 150642896 words (6017959/s), 272716 word types, 880000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #890000, processed 152255380 words (6006283/s), 273848 word types, 890000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #900000, processed 153832870 words (5981867/s), 275186 word types, 900000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #910000, processed 155513857 words (5905310/s), 277084 word types, 910000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #920000, processed 157398037 words (5959351/s), 279157 word types, 920000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #930000, processed 159357845 words (6025168/s), 281444 word types, 930000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #940000, processed 160988419 words (5802634/s), 283100 word types, 940000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #950000, processed 162505547 words (5587221/s), 284256 word types, 950000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #960000, processed 164189140 words (5740618/s), 285693 word types, 960000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #970000, processed 165978334 words (5849367/s), 286851 word types, 970000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #980000, processed 167474748 words (5777903/s), 288109 word types, 980000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #990000, processed 169015976 words (5725017/s), 289358 word types, 990000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1000000, processed 170727786 words (5940898/s), 291019 word types, 1000000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1010000, processed 172276215 words (5560945/s), 292047 word types, 1010000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1020000, processed 173835504 words (5447323/s), 293097 word types, 1020000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1030000, processed 175376928 words (5403980/s), 294283 word types, 1030000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1040000, processed 177389469 words (5668908/s), 295588 word types, 1040000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1050000, processed 178978766 words (5647964/s), 296828 word types, 1050000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1060000, processed 180650935 words (5549979/s), 298047 word types, 1060000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1070000, processed 182444913 words (5655765/s), 300049 word types, 1070000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1080000, processed 183998850 words (5791116/s), 301962 word types, 1080000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1090000, processed 185730550 words (5699414/s), 303462 word types, 1090000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1100000, processed 187275768 words (5751674/s), 304403 word types, 1100000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1110000, processed 188988161 words (5937702/s), 305038 word types, 1110000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1120000, processed 190563278 words (5798007/s), 306308 word types, 1120000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1130000, processed 192360392 words (5952165/s), 307211 word types, 1130000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1140000, processed 193995780 words (5976002/s), 308721 word types, 1140000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1150000, processed 195687275 words (5809723/s), 310501 word types, 1150000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1160000, processed 197262490 words (5670448/s), 311594 word types, 1160000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1170000, processed 198886961 words (5200422/s), 312667 word types, 1170000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1180000, processed 200331440 words (5391425/s), 314188 word types, 1180000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1190000, processed 202085402 words (5921727/s), 315643 word types, 1190000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1200000, processed 203613359 words (5691907/s), 316749 word types, 1200000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1210000, processed 205092938 words (5389088/s), 318006 word types, 1210000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1220000, processed 206682162 words (5420717/s), 319491 word types, 1220000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1230000, processed 208331878 words (5476291/s), 320597 word types, 1230000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1240000, processed 209855183 words (5466446/s), 322014 word types, 1240000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1250000, processed 211722518 words (5557862/s), 323253 word types, 1250000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1260000, processed 213677052 words (5579818/s), 324750 word types, 1260000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1270000, processed 215443450 words (5330505/s), 325440 word types, 1270000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1280000, processed 217215129 words (5402038/s), 326935 word types, 1280000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1290000, processed 218688739 words (5570228/s), 327872 word types, 1290000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1300000, processed 220149483 words (5404106/s), 329265 word types, 1300000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1310000, processed 222110259 words (5974683/s), 331108 word types, 1310000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1320000, processed 223649280 words (5734143/s), 332146 word types, 1320000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1330000, processed 225225634 words (5898797/s), 333064 word types, 1330000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1340000, processed 226801190 words (5945796/s), 333791 word types, 1340000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1350000, processed 228486299 words (5641418/s), 334723 word types, 1350000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1360000, processed 230173355 words (5887537/s), 335751 word types, 1360000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1370000, processed 231855180 words (5865024/s), 337200 word types, 1370000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1380000, processed 233444240 words (5770100/s), 338209 word types, 1380000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1390000, processed 235104624 words (5868363/s), 339076 word types, 1390000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1400000, processed 236992722 words (5993932/s), 340182 word types, 1400000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1410000, processed 238468777 words (5731969/s), 341582 word types, 1410000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1420000, processed 240125966 words (5706673/s), 342397 word types, 1420000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1430000, processed 241807027 words (5682035/s), 343640 word types, 1430000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1440000, processed 243665202 words (5728528/s), 345340 word types, 1440000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1450000, processed 245283888 words (5736648/s), 346397 word types, 1450000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1460000, processed 246769469 words (5499809/s), 347304 word types, 1460000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1470000, processed 248438764 words (5742902/s), 348765 word types, 1470000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1480000, processed 250087004 words (5593897/s), 349765 word types, 1480000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1490000, processed 251857794 words (5834715/s), 350938 word types, 1490000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1500000, processed 254235138 words (5877812/s), 352628 word types, 1500000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1510000, processed 255795938 words (5287635/s), 353481 word types, 1510000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1520000, processed 257257585 words (5295414/s), 354453 word types, 1520000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1530000, processed 258825056 words (5387591/s), 355776 word types, 1530000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1540000, processed 260445564 words (5402810/s), 356340 word types, 1540000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1550000, processed 262138951 words (5866387/s), 357227 word types, 1550000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1560000, processed 263999652 words (5846907/s), 358455 word types, 1560000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1570000, processed 265620639 words (5430382/s), 359435 word types, 1570000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1580000, processed 267192282 words (5378386/s), 360387 word types, 1580000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1590000, processed 268875992 words (5390197/s), 361587 word types, 1590000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1600000, processed 270457838 words (5383528/s), 362452 word types, 1600000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1610000, processed 272004157 words (5370347/s), 363179 word types, 1610000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1620000, processed 273601207 words (5395440/s), 364034 word types, 1620000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1630000, processed 274681026 words (5323788/s), 366261 word types, 1630000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1640000, processed 275674939 words (5547738/s), 368645 word types, 1640000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1650000, processed 276668117 words (5366979/s), 370919 word types, 1650000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1660000, processed 277668016 words (5521627/s), 372860 word types, 1660000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1670000, processed 278662048 words (5454620/s), 374838 word types, 1670000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1680000, processed 279657809 words (5442379/s), 376589 word types, 1680000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1690000, processed 280651996 words (5532796/s), 378373 word types, 1690000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1700000, processed 281652758 words (5525959/s), 380097 word types, 1700000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1710000, processed 282646166 words (5472359/s), 381649 word types, 1710000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1720000, processed 283670041 words (5548798/s), 383271 word types, 1720000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1730000, processed 284663946 words (5480987/s), 384865 word types, 1730000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1740000, processed 285670017 words (5570807/s), 386476 word types, 1740000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1750000, processed 286671173 words (5576296/s), 388025 word types, 1750000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1760000, processed 287677653 words (5561974/s), 389517 word types, 1760000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1770000, processed 288690882 words (5498660/s), 390916 word types, 1770000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1780000, processed 289696646 words (5483239/s), 392283 word types, 1780000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1790000, processed 290692733 words (5503103/s), 393754 word types, 1790000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1800000, processed 291702379 words (5514418/s), 395164 word types, 1800000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1810000, processed 292717862 words (5476000/s), 396627 word types, 1810000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1820000, processed 293714420 words (5554086/s), 397982 word types, 1820000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1830000, processed 294725963 words (5400001/s), 399307 word types, 1830000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1840000, processed 295731560 words (5164997/s), 400661 word types, 1840000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1850000, processed 296741218 words (5332508/s), 401929 word types, 1850000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1860000, processed 297746075 words (5438557/s), 403183 word types, 1860000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1870000, processed 298754648 words (5432810/s), 404502 word types, 1870000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1880000, processed 299756989 words (5448977/s), 405718 word types, 1880000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1890000, processed 300761539 words (5450991/s), 406983 word types, 1890000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1900000, processed 301751608 words (5452316/s), 408319 word types, 1900000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1910000, processed 302767714 words (5495656/s), 409598 word types, 1910000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1920000, processed 303761690 words (5482692/s), 410755 word types, 1920000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1930000, processed 304765038 words (5508205/s), 412064 word types, 1930000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1940000, processed 305782636 words (5450636/s), 413308 word types, 1940000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1950000, processed 306790814 words (5346354/s), 414510 word types, 1950000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1960000, processed 307805829 words (5267669/s), 415813 word types, 1960000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1970000, processed 308803399 words (5162756/s), 416835 word types, 1970000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1980000, processed 309791087 words (5144426/s), 417957 word types, 1980000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #1990000, processed 310794131 words (5176224/s), 419012 word types, 1990000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2000000, processed 311801296 words (5145468/s), 420207 word types, 2000000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2010000, processed 312809891 words (5501326/s), 421467 word types, 2010000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2020000, processed 313819224 words (5520007/s), 422577 word types, 2020000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2030000, processed 314853380 words (5493920/s), 423756 word types, 2030000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2040000, processed 315869757 words (5430803/s), 424914 word types, 2040000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2050000, processed 316890731 words (5279199/s), 425992 word types, 2050000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2060000, processed 317904002 words (4629452/s), 427193 word types, 2060000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2070000, processed 318920796 words (4769405/s), 428286 word types, 2070000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2080000, processed 319960622 words (5307049/s), 429491 word types, 2080000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2090000, processed 321008080 words (4929540/s), 430761 word types, 2090000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2100000, processed 322061326 words (5130974/s), 432019 word types, 2100000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2110000, processed 323095612 words (5310026/s), 433205 word types, 2110000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2120000, processed 324156800 words (4958928/s), 434446 word types, 2120000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2130000, processed 325198491 words (4845432/s), 435530 word types, 2130000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2140000, processed 326231702 words (4606879/s), 436660 word types, 2140000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2150000, processed 327287666 words (4921788/s), 437731 word types, 2150000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2160000, processed 328330478 words (5073468/s), 438843 word types, 2160000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2170000, processed 329377981 words (5135675/s), 439961 word types, 2170000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2180000, processed 330425742 words (5179396/s), 441046 word types, 2180000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2190000, processed 331457558 words (5243335/s), 442197 word types, 2190000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2200000, processed 332508787 words (5319442/s), 443294 word types, 2200000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2210000, processed 333566893 words (5282885/s), 444402 word types, 2210000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2220000, processed 334632680 words (5229566/s), 445469 word types, 2220000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2230000, processed 335689334 words (4742974/s), 446580 word types, 2230000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2240000, processed 336739648 words (4904958/s), 447653 word types, 2240000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2250000, processed 337788638 words (4853760/s), 448731 word types, 2250000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2260000, processed 338855519 words (5112412/s), 449800 word types, 2260000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2270000, processed 339924938 words (5231538/s), 450866 word types, 2270000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2280000, processed 340985609 words (5366028/s), 451892 word types, 2280000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2290000, processed 342051146 words (5540755/s), 452913 word types, 2290000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2300000, processed 343132023 words (5413500/s), 453988 word types, 2300000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2310000, processed 344201849 words (5412923/s), 455040 word types, 2310000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2320000, processed 345275531 words (5299756/s), 456183 word types, 2320000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2330000, processed 346341979 words (5293329/s), 457385 word types, 2330000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2340000, processed 347397455 words (5385997/s), 458400 word types, 2340000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2350000, processed 348473256 words (5321136/s), 459387 word types, 2350000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2360000, processed 349550718 words (5487016/s), 460325 word types, 2360000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2370000, processed 350590109 words (5516552/s), 461341 word types, 2370000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2380000, processed 351654983 words (5418686/s), 462335 word types, 2380000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2390000, processed 352708841 words (5424300/s), 463350 word types, 2390000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2400000, processed 353731894 words (5267530/s), 464501 word types, 2400000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2410000, processed 354721583 words (5348599/s), 465609 word types, 2410000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2420000, processed 355721885 words (5348534/s), 466737 word types, 2420000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2430000, processed 356723950 words (5310311/s), 467778 word types, 2430000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2440000, processed 357730929 words (5102548/s), 468974 word types, 2440000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2450000, processed 358715077 words (4991022/s), 470112 word types, 2450000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2460000, processed 359702621 words (4665797/s), 471183 word types, 2460000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2470000, processed 360695920 words (4962638/s), 472147 word types, 2470000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2480000, processed 361710618 words (4979416/s), 473167 word types, 2480000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2490000, processed 362716232 words (4829097/s), 474220 word types, 2490000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2500000, processed 363729315 words (4845208/s), 475214 word types, 2500000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2510000, processed 364711421 words (4742858/s), 476203 word types, 2510000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2520000, processed 365706696 words (4941142/s), 477110 word types, 2520000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2530000, processed 366708134 words (4959402/s), 478069 word types, 2530000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2540000, processed 367701287 words (4927982/s), 479034 word types, 2540000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2550000, processed 368706094 words (4739652/s), 479957 word types, 2550000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2560000, processed 369722544 words (4941851/s), 480884 word types, 2560000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2570000, processed 370724952 words (4923902/s), 481794 word types, 2570000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2580000, processed 371731611 words (4945382/s), 482645 word types, 2580000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2590000, processed 372735869 words (5261595/s), 483695 word types, 2590000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2600000, processed 373733952 words (5311643/s), 484605 word types, 2600000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2610000, processed 374738713 words (5429108/s), 485470 word types, 2610000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2620000, processed 375734314 words (5272923/s), 486393 word types, 2620000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2630000, processed 376737822 words (4978347/s), 487319 word types, 2630000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2640000, processed 377740240 words (5139740/s), 488284 word types, 2640000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2650000, processed 378740038 words (5276870/s), 489182 word types, 2650000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2660000, processed 379752011 words (5020131/s), 490082 word types, 2660000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2670000, processed 380752626 words (4244127/s), 490931 word types, 2670000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2680000, processed 381760382 words (4963171/s), 491824 word types, 2680000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2690000, processed 382764627 words (5452834/s), 492603 word types, 2690000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2700000, processed 383779760 words (4547649/s), 493531 word types, 2700000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2710000, processed 384769862 words (4992721/s), 494453 word types, 2710000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2720000, processed 385764093 words (5339094/s), 495366 word types, 2720000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2730000, processed 386771805 words (5411057/s), 496263 word types, 2730000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2740000, processed 387764145 words (5462608/s), 497126 word types, 2740000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2750000, processed 388773944 words (5247957/s), 497993 word types, 2750000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2760000, processed 389774974 words (4543045/s), 498844 word types, 2760000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2770000, processed 390787549 words (4714287/s), 499727 word types, 2770000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2780000, processed 391788468 words (4884899/s), 500642 word types, 2780000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2790000, processed 392785959 words (4925234/s), 501548 word types, 2790000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2800000, processed 393792263 words (5109939/s), 502446 word types, 2800000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2810000, processed 394800324 words (5064472/s), 503247 word types, 2810000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2820000, processed 395812650 words (4763386/s), 504029 word types, 2820000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2830000, processed 396837349 words (4788593/s), 504922 word types, 2830000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2840000, processed 397854129 words (5328128/s), 505790 word types, 2840000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2850000, processed 398864365 words (4953683/s), 506560 word types, 2850000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2860000, processed 399896483 words (5267659/s), 507471 word types, 2860000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2870000, processed 400927294 words (5372506/s), 508337 word types, 2870000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2880000, processed 401974138 words (5458747/s), 509194 word types, 2880000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2890000, processed 403024202 words (5507738/s), 510126 word types, 2890000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2900000, processed 404074688 words (5448672/s), 511165 word types, 2900000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2910000, processed 405112523 words (5425080/s), 512065 word types, 2910000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2920000, processed 406176134 words (5386296/s), 512946 word types, 2920000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2930000, processed 407240197 words (5183988/s), 513918 word types, 2930000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2940000, processed 408307560 words (5389658/s), 515075 word types, 2940000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2950000, processed 409358740 words (5435401/s), 516013 word types, 2950000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2960000, processed 410429390 words (5490132/s), 516908 word types, 2960000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2970000, processed 411488684 words (5429302/s), 517798 word types, 2970000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2980000, processed 412545671 words (5551973/s), 518722 word types, 2980000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #2990000, processed 413594663 words (5444174/s), 519637 word types, 2990000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3000000, processed 414652059 words (5435070/s), 520490 word types, 3000000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3010000, processed 415706328 words (5406397/s), 521231 word types, 3010000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3020000, processed 416769314 words (5404959/s), 522120 word types, 3020000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3030000, processed 417817605 words (5410815/s), 522922 word types, 3030000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3040000, processed 418859128 words (5326679/s), 523683 word types, 3040000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3050000, processed 419931507 words (5378694/s), 524658 word types, 3050000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3060000, processed 420986100 words (5471777/s), 525526 word types, 3060000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3070000, processed 422067464 words (5449325/s), 526496 word types, 3070000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3080000, processed 423123061 words (5025946/s), 527398 word types, 3080000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3090000, processed 424207489 words (5060570/s), 528315 word types, 3090000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3100000, processed 425271124 words (5401639/s), 529234 word types, 3100000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3110000, processed 426326758 words (5548402/s), 530067 word types, 3110000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3120000, processed 427390291 words (5456189/s), 530788 word types, 3120000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3130000, processed 428434489 words (5474430/s), 531606 word types, 3130000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3140000, processed 429494983 words (5530686/s), 532445 word types, 3140000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3150000, processed 430564252 words (5490069/s), 533208 word types, 3150000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3160000, processed 431624551 words (5471394/s), 534139 word types, 3160000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3170000, processed 432682696 words (5461229/s), 534942 word types, 3170000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3180000, processed 433755599 words (5566583/s), 535710 word types, 3180000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3190000, processed 434820856 words (5577723/s), 536517 word types, 3190000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3200000, processed 435847746 words (5397780/s), 537440 word types, 3200000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3210000, processed 436837743 words (5401904/s), 538447 word types, 3210000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3220000, processed 437837636 words (5277075/s), 539308 word types, 3220000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3230000, processed 438833055 words (5474803/s), 540224 word types, 3230000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3240000, processed 439835056 words (5472714/s), 541152 word types, 3240000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3250000, processed 440818241 words (5537148/s), 541960 word types, 3250000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3260000, processed 441806340 words (5334839/s), 542884 word types, 3260000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3270000, processed 442816975 words (5249144/s), 543736 word types, 3270000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3280000, processed 443817487 words (4763949/s), 544651 word types, 3280000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3290000, processed 444805273 words (4971777/s), 545441 word types, 3290000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3300000, processed 445805420 words (5056865/s), 546204 word types, 3300000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3310000, processed 446819880 words (5228725/s), 547007 word types, 3310000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3320000, processed 447809301 words (5407232/s), 547869 word types, 3320000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3330000, processed 448798128 words (5424291/s), 548722 word types, 3330000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3340000, processed 449810808 words (5476098/s), 549514 word types, 3340000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3350000, processed 450816890 words (5439188/s), 550370 word types, 3350000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3360000, processed 451808851 words (5483115/s), 551150 word types, 3360000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3370000, processed 452811572 words (5447433/s), 552143 word types, 3370000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3380000, processed 453819450 words (4981236/s), 552956 word types, 3380000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3390000, processed 454834163 words (5132774/s), 553789 word types, 3390000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3400000, processed 455838155 words (5343391/s), 554595 word types, 3400000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3410000, processed 456831576 words (5404707/s), 555325 word types, 3410000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3420000, processed 457833713 words (5447777/s), 556147 word types, 3420000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3430000, processed 458826978 words (5411811/s), 556835 word types, 3430000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3440000, processed 459827767 words (5337022/s), 557576 word types, 3440000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3450000, processed 460820864 words (5336096/s), 558326 word types, 3450000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3460000, processed 461812567 words (5181049/s), 558996 word types, 3460000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3470000, processed 462816167 words (5279983/s), 559770 word types, 3470000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3480000, processed 463823346 words (5302477/s), 560547 word types, 3480000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3490000, processed 464833492 words (5151022/s), 561348 word types, 3490000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3500000, processed 465842166 words (5041244/s), 562107 word types, 3500000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3510000, processed 466839780 words (4935419/s), 562869 word types, 3510000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3520000, processed 467849917 words (4708290/s), 563631 word types, 3520000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3530000, processed 468842046 words (5134017/s), 564386 word types, 3530000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3540000, processed 469855638 words (5262261/s), 565143 word types, 3540000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3550000, processed 470863473 words (5081773/s), 565915 word types, 3550000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3560000, processed 471867935 words (5014115/s), 566642 word types, 3560000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3570000, processed 472888730 words (4624103/s), 567370 word types, 3570000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3580000, processed 473902690 words (4898599/s), 568153 word types, 3580000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3590000, processed 474911503 words (4884487/s), 568918 word types, 3590000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3600000, processed 475909151 words (4822210/s), 569727 word types, 3600000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3610000, processed 476948396 words (5303061/s), 570490 word types, 3610000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3620000, processed 477966975 words (5395761/s), 571273 word types, 3620000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3630000, processed 479001036 words (5427516/s), 572044 word types, 3630000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3640000, processed 480025216 words (5051091/s), 572777 word types, 3640000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3650000, processed 481031064 words (4944508/s), 573526 word types, 3650000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3660000, processed 482036650 words (4745854/s), 574292 word types, 3660000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3670000, processed 483069552 words (4883630/s), 575085 word types, 3670000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3680000, processed 484108996 words (4896981/s), 575758 word types, 3680000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3690000, processed 485143992 words (5340986/s), 576510 word types, 3690000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3700000, processed 486173122 words (5052789/s), 577314 word types, 3700000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3710000, processed 487216880 words (4878594/s), 578047 word types, 3710000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3720000, processed 488263698 words (4784715/s), 578891 word types, 3720000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3730000, processed 489320042 words (5200444/s), 579727 word types, 3730000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3740000, processed 490368053 words (5195773/s), 580481 word types, 3740000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3750000, processed 491408735 words (4858225/s), 581277 word types, 3750000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3760000, processed 492480025 words (4649870/s), 582113 word types, 3760000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3770000, processed 493532392 words (4930409/s), 582903 word types, 3770000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3780000, processed 494571090 words (4883164/s), 583778 word types, 3780000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3790000, processed 495615830 words (4908362/s), 584608 word types, 3790000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3800000, processed 496645724 words (4907516/s), 585448 word types, 3800000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3810000, processed 497696683 words (4874503/s), 586351 word types, 3810000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3820000, processed 498750616 words (4954567/s), 587095 word types, 3820000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3830000, processed 499811560 words (4811265/s), 587892 word types, 3830000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3840000, processed 500865757 words (4962775/s), 588615 word types, 3840000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3850000, processed 501921318 words (4651237/s), 589273 word types, 3850000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3860000, processed 502973901 words (4954366/s), 590101 word types, 3860000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3870000, processed 504029913 words (5260207/s), 590810 word types, 3870000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3880000, processed 505091914 words (4935305/s), 591601 word types, 3880000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3890000, processed 506158228 words (4949797/s), 592318 word types, 3890000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3900000, processed 507223434 words (4769633/s), 593064 word types, 3900000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3910000, processed 508296882 words (4946888/s), 593842 word types, 3910000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3920000, processed 509371046 words (5054871/s), 594635 word types, 3920000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3930000, processed 510450891 words (5077069/s), 595379 word types, 3930000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3940000, processed 511522802 words (5313646/s), 596196 word types, 3940000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3950000, processed 512596193 words (4946011/s), 596982 word types, 3950000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3960000, processed 513688719 words (4716251/s), 597868 word types, 3960000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3970000, processed 514785261 words (4853815/s), 598664 word types, 3970000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3980000, processed 515875935 words (4861599/s), 599511 word types, 3980000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #3990000, processed 516947457 words (4648574/s), 600252 word types, 3990000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4000000, processed 517962656 words (4877455/s), 601109 word types, 4000000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4010000, processed 518972670 words (5287748/s), 601919 word types, 4010000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4020000, processed 519966583 words (5260859/s), 602701 word types, 4020000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4030000, processed 520966646 words (5350983/s), 603459 word types, 4030000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4040000, processed 521962881 words (5281338/s), 604227 word types, 4040000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4050000, processed 522935595 words (5310170/s), 604943 word types, 4050000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4060000, processed 523922094 words (5162168/s), 605891 word types, 4060000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4070000, processed 524926177 words (4939418/s), 606593 word types, 4070000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4080000, processed 525925733 words (4799920/s), 607381 word types, 4080000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4090000, processed 526910996 words (4888228/s), 608063 word types, 4090000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4100000, processed 527915213 words (4802783/s), 608831 word types, 4100000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4110000, processed 528944605 words (4882393/s), 609509 word types, 4110000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4120000, processed 529942192 words (5064257/s), 610241 word types, 4120000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4130000, processed 530936018 words (4971123/s), 610947 word types, 4130000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4140000, processed 531944409 words (4838592/s), 611664 word types, 4140000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4150000, processed 532939000 words (4673881/s), 612406 word types, 4150000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4160000, processed 533965753 words (4478614/s), 613231 word types, 4160000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4170000, processed 534959540 words (4730488/s), 613968 word types, 4170000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4180000, processed 535973480 words (4833137/s), 614715 word types, 4180000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4190000, processed 536980130 words (5152374/s), 615400 word types, 4190000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4200000, processed 538001217 words (5301841/s), 616291 word types, 4200000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4210000, processed 539003508 words (5400224/s), 616977 word types, 4210000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4220000, processed 540001111 words (5366688/s), 617635 word types, 4220000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4230000, processed 540995746 words (5405705/s), 618413 word types, 4230000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4240000, processed 542005320 words (5426283/s), 619154 word types, 4240000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4250000, processed 542987029 words (5426814/s), 619836 word types, 4250000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4260000, processed 544008911 words (5413782/s), 620496 word types, 4260000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4270000, processed 544990384 words (5404626/s), 621142 word types, 4270000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4280000, processed 545998348 words (5411338/s), 621882 word types, 4280000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4290000, processed 547000573 words (5405361/s), 622526 word types, 4290000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4300000, processed 548002841 words (5326311/s), 623329 word types, 4300000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4310000, processed 549009831 words (5431874/s), 624064 word types, 4310000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4320000, processed 550008475 words (5373360/s), 624708 word types, 4320000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4330000, processed 551017477 words (5337343/s), 625406 word types, 4330000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4340000, processed 552014224 words (5351439/s), 626069 word types, 4340000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4350000, processed 553024336 words (5384684/s), 626751 word types, 4350000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4360000, processed 554007018 words (5196001/s), 627386 word types, 4360000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4370000, processed 554996076 words (5374334/s), 628061 word types, 4370000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4380000, processed 556000689 words (5352084/s), 628784 word types, 4380000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4390000, processed 557018662 words (5233263/s), 629438 word types, 4390000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4400000, processed 558036285 words (5128478/s), 630067 word types, 4400000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4410000, processed 559049308 words (4889709/s), 630706 word types, 4410000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4420000, processed 560060391 words (4955247/s), 631410 word types, 4420000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4430000, processed 561092097 words (4984528/s), 632100 word types, 4430000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4440000, processed 562131413 words (4940332/s), 632834 word types, 4440000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4450000, processed 563149074 words (4893309/s), 633500 word types, 4450000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4460000, processed 564182172 words (5021116/s), 634172 word types, 4460000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4470000, processed 565203779 words (5054172/s), 634911 word types, 4470000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4480000, processed 566236634 words (4946419/s), 635654 word types, 4480000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4490000, processed 567275423 words (4724744/s), 636481 word types, 4490000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4500000, processed 568309858 words (5129185/s), 637201 word types, 4500000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4510000, processed 569350746 words (5332635/s), 637986 word types, 4510000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4520000, processed 570383244 words (5351918/s), 638611 word types, 4520000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4530000, processed 571425758 words (5353822/s), 639376 word types, 4530000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4540000, processed 572479354 words (5225996/s), 640171 word types, 4540000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4550000, processed 573531343 words (5243583/s), 640826 word types, 4550000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4560000, processed 574555791 words (5424866/s), 641475 word types, 4560000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4570000, processed 575596203 words (5387115/s), 642231 word types, 4570000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4580000, processed 576637061 words (5458444/s), 643025 word types, 4580000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4590000, processed 577659622 words (5323677/s), 643681 word types, 4590000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4600000, processed 578682045 words (5254191/s), 644333 word types, 4600000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4610000, processed 579710719 words (5412019/s), 645028 word types, 4610000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4620000, processed 580795249 words (5519677/s), 645708 word types, 4620000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4630000, processed 581854995 words (5413280/s), 646368 word types, 4630000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4640000, processed 582931265 words (5478009/s), 647143 word types, 4640000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4650000, processed 583983097 words (5326768/s), 647859 word types, 4650000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4660000, processed 585055006 words (5423959/s), 648594 word types, 4660000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4670000, processed 586116688 words (5341190/s), 649223 word types, 4670000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4680000, processed 587180235 words (5319931/s), 649890 word types, 4680000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4690000, processed 588243368 words (5342745/s), 650590 word types, 4690000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4700000, processed 589311438 words (5346651/s), 651371 word types, 4700000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4710000, processed 590384404 words (5063364/s), 652086 word types, 4710000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4720000, processed 591446010 words (4946494/s), 652764 word types, 4720000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4730000, processed 592514100 words (4868769/s), 653427 word types, 4730000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4740000, processed 593590131 words (4959738/s), 654091 word types, 4740000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4750000, processed 594650778 words (4908366/s), 654800 word types, 4750000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4760000, processed 595701238 words (5088508/s), 655564 word types, 4760000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4770000, processed 596774846 words (5398431/s), 656129 word types, 4770000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4780000, processed 597824744 words (5308494/s), 656844 word types, 4780000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4790000, processed 598880988 words (5207477/s), 657567 word types, 4790000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4800000, processed 599954836 words (5263043/s), 658236 word types, 4800000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4810000, processed 601034148 words (5326213/s), 658963 word types, 4810000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4820000, processed 602099471 words (5243716/s), 659623 word types, 4820000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4830000, processed 603199745 words (5218370/s), 660270 word types, 4830000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4840000, processed 604326930 words (5393397/s), 661003 word types, 4840000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4850000, processed 605338494 words (4818877/s), 661705 word types, 4850000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4860000, processed 606322907 words (5159444/s), 662425 word types, 4860000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4870000, processed 607291964 words (5318375/s), 663209 word types, 4870000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4880000, processed 608280964 words (5176142/s), 664031 word types, 4880000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4890000, processed 609260954 words (5319291/s), 664781 word types, 4890000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4900000, processed 610254916 words (5063643/s), 665452 word types, 4900000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4910000, processed 611246962 words (5283467/s), 666136 word types, 4910000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4920000, processed 612246373 words (5291933/s), 666857 word types, 4920000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4930000, processed 613232330 words (5252544/s), 667526 word types, 4930000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4940000, processed 614205717 words (4809404/s), 668218 word types, 4940000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4950000, processed 615203549 words (4641432/s), 668931 word types, 4950000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4960000, processed 616198455 words (4840354/s), 669620 word types, 4960000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4970000, processed 617216172 words (4796710/s), 670244 word types, 4970000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4980000, processed 618207747 words (4737504/s), 670891 word types, 4980000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #4990000, processed 619196537 words (4788829/s), 671618 word types, 4990000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5000000, processed 620196224 words (4796603/s), 672232 word types, 5000000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5010000, processed 621193417 words (4945359/s), 672861 word types, 5010000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5020000, processed 622175991 words (4932853/s), 673476 word types, 5020000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5030000, processed 623158950 words (5434783/s), 674191 word types, 5030000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5040000, processed 624144042 words (5436057/s), 675023 word types, 5040000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5050000, processed 625132929 words (5272024/s), 675801 word types, 5050000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5060000, processed 626116161 words (5310861/s), 676478 word types, 5060000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5070000, processed 627114059 words (5462373/s), 677183 word types, 5070000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5080000, processed 628109355 words (5344826/s), 677867 word types, 5080000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5090000, processed 629087683 words (4949321/s), 678501 word types, 5090000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5100000, processed 630093150 words (4855979/s), 679133 word types, 5100000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5110000, processed 631068487 words (4849073/s), 679785 word types, 5110000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5120000, processed 632082198 words (4812066/s), 680493 word types, 5120000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5130000, processed 633058841 words (4862579/s), 681290 word types, 5130000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5140000, processed 634049257 words (4860629/s), 681884 word types, 5140000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5150000, processed 635047585 words (4902204/s), 682478 word types, 5150000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5160000, processed 636054314 words (4948375/s), 683127 word types, 5160000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5170000, processed 637062198 words (5215861/s), 683758 word types, 5170000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5180000, processed 638059145 words (5266510/s), 684341 word types, 5180000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5190000, processed 639076802 words (5083532/s), 684942 word types, 5190000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5200000, processed 640070040 words (4950672/s), 685555 word types, 5200000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5210000, processed 641063380 words (4974156/s), 686120 word types, 5210000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5220000, processed 642065516 words (4976764/s), 686812 word types, 5220000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5230000, processed 643057017 words (4949368/s), 687427 word types, 5230000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5240000, processed 644071099 words (5024735/s), 688056 word types, 5240000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5250000, processed 645056745 words (5079910/s), 688672 word types, 5250000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5260000, processed 646066586 words (4960620/s), 689287 word types, 5260000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5270000, processed 647092387 words (5488236/s), 689957 word types, 5270000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5280000, processed 648114173 words (5281438/s), 690619 word types, 5280000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5290000, processed 649151331 words (5006493/s), 691279 word types, 5290000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5300000, processed 650196882 words (5012787/s), 691921 word types, 5300000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5310000, processed 651228749 words (5139870/s), 692536 word types, 5310000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5320000, processed 652251977 words (5088983/s), 693234 word types, 5320000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5330000, processed 653281620 words (5502412/s), 693808 word types, 5330000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5340000, processed 654315317 words (5292626/s), 694546 word types, 5340000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5350000, processed 655341973 words (5094084/s), 695178 word types, 5350000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5360000, processed 656381594 words (5021817/s), 695839 word types, 5360000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5370000, processed 657415537 words (5049383/s), 696511 word types, 5370000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5380000, processed 658464396 words (4993632/s), 697155 word types, 5380000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5390000, processed 659496918 words (5472871/s), 698158 word types, 5390000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5400000, processed 660531548 words (5414092/s), 698781 word types, 5400000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5410000, processed 661560356 words (4695765/s), 699418 word types, 5410000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5420000, processed 662596742 words (5349558/s), 700030 word types, 5420000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5430000, processed 663628450 words (5255594/s), 700619 word types, 5430000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5440000, processed 664680512 words (5255427/s), 701305 word types, 5440000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5450000, processed 665736502 words (5410337/s), 702061 word types, 5450000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5460000, processed 666803560 words (5347988/s), 702809 word types, 5460000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5470000, processed 667864570 words (5495483/s), 703501 word types, 5470000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5480000, processed 668933167 words (5483600/s), 704097 word types, 5480000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5490000, processed 669992074 words (5406423/s), 704714 word types, 5490000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5500000, processed 671057396 words (5432148/s), 705352 word types, 5500000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5510000, processed 672137776 words (5479639/s), 706047 word types, 5510000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5520000, processed 673201862 words (5504957/s), 706849 word types, 5520000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5530000, processed 674260379 words (5476994/s), 707505 word types, 5530000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5540000, processed 675342338 words (5430693/s), 708184 word types, 5540000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5550000, processed 676412828 words (5449230/s), 708780 word types, 5550000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5560000, processed 677487905 words (5386948/s), 709401 word types, 5560000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5570000, processed 678540429 words (5531331/s), 710009 word types, 5570000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5580000, processed 679615475 words (5502916/s), 710621 word types, 5580000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5590000, processed 680680534 words (5427429/s), 711355 word types, 5590000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5600000, processed 681746199 words (5330412/s), 712010 word types, 5600000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5610000, processed 682800620 words (5239866/s), 712659 word types, 5610000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5620000, processed 683873499 words (5410248/s), 713291 word types, 5620000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5630000, processed 684927709 words (5460177/s), 713851 word types, 5630000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5640000, processed 685980697 words (5351750/s), 714519 word types, 5640000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5650000, processed 687066505 words (5404139/s), 715187 word types, 5650000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5660000, processed 688111613 words (5256940/s), 715891 word types, 5660000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5670000, processed 689096371 words (5412249/s), 716655 word types, 5670000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5680000, processed 690074367 words (5472368/s), 717297 word types, 5680000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5690000, processed 691048870 words (5374843/s), 717988 word types, 5690000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5700000, processed 692052223 words (5164019/s), 718663 word types, 5700000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5710000, processed 693038736 words (4946020/s), 719300 word types, 5710000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5720000, processed 694040815 words (4994709/s), 719940 word types, 5720000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5730000, processed 695040838 words (5005188/s), 720562 word types, 5730000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5740000, processed 696046706 words (5124080/s), 721237 word types, 5740000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5750000, processed 697045040 words (5374661/s), 721825 word types, 5750000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5760000, processed 698055442 words (5377454/s), 722480 word types, 5760000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5770000, processed 699044975 words (5319934/s), 723189 word types, 5770000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5780000, processed 700033648 words (5319074/s), 723831 word types, 5780000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5790000, processed 701052411 words (5287880/s), 724465 word types, 5790000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5800000, processed 702052043 words (5341428/s), 725069 word types, 5800000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5810000, processed 703062691 words (5432678/s), 725702 word types, 5810000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5820000, processed 704075564 words (5472466/s), 726302 word types, 5820000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5830000, processed 705074127 words (5403627/s), 726939 word types, 5830000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5840000, processed 706082193 words (5433901/s), 727547 word types, 5840000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5850000, processed 707101473 words (5375477/s), 728153 word types, 5850000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5860000, processed 708110577 words (5448968/s), 728712 word types, 5860000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5870000, processed 709125034 words (5448875/s), 729296 word types, 5870000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5880000, processed 710127912 words (5485633/s), 729942 word types, 5880000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5890000, processed 711129682 words (5483717/s), 730575 word types, 5890000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5900000, processed 712122251 words (5435036/s), 731174 word types, 5900000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5910000, processed 713127298 words (5406719/s), 731752 word types, 5910000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5920000, processed 714144275 words (5464847/s), 732280 word types, 5920000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5930000, processed 715150669 words (5396902/s), 732883 word types, 5930000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5940000, processed 716157231 words (5470212/s), 733441 word types, 5940000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5950000, processed 717146388 words (5425877/s), 733993 word types, 5950000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5960000, processed 718143001 words (5323765/s), 734572 word types, 5960000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5970000, processed 719147251 words (5376681/s), 735176 word types, 5970000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5980000, processed 720153634 words (5267476/s), 735840 word types, 5980000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #5990000, processed 721155992 words (5461579/s), 736390 word types, 5990000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6000000, processed 722152226 words (5503964/s), 736901 word types, 6000000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6010000, processed 723161737 words (5464686/s), 737399 word types, 6010000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6020000, processed 724156518 words (5423333/s), 737911 word types, 6020000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6030000, processed 725166492 words (5423189/s), 738457 word types, 6030000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6040000, processed 726164593 words (5446223/s), 739103 word types, 6040000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6050000, processed 727170253 words (5492337/s), 739639 word types, 6050000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6060000, processed 728179411 words (5390706/s), 740260 word types, 6060000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6070000, processed 729176146 words (5426775/s), 740827 word types, 6070000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6080000, processed 730177212 words (5473864/s), 741447 word types, 6080000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6090000, processed 731204378 words (5403587/s), 742047 word types, 6090000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6100000, processed 732260247 words (5451961/s), 742774 word types, 6100000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6110000, processed 733309310 words (5430519/s), 743349 word types, 6110000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6120000, processed 734358704 words (5485779/s), 744040 word types, 6120000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6130000, processed 735429085 words (5473871/s), 744698 word types, 6130000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6140000, processed 736494589 words (5407362/s), 745316 word types, 6140000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6150000, processed 737546572 words (5414857/s), 745976 word types, 6150000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6160000, processed 738600312 words (5448595/s), 746573 word types, 6160000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6170000, processed 739655067 words (5076995/s), 747147 word types, 6170000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6180000, processed 740717599 words (5012990/s), 747733 word types, 6180000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6190000, processed 741764366 words (4997005/s), 748362 word types, 6190000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6200000, processed 742818172 words (5053209/s), 749018 word types, 6200000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6210000, processed 743850206 words (5048755/s), 749738 word types, 6210000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6220000, processed 744919915 words (5048527/s), 750370 word types, 6220000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6230000, processed 745968076 words (5013196/s), 751050 word types, 6230000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6240000, processed 747032588 words (5063603/s), 751671 word types, 6240000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6250000, processed 748075059 words (5078580/s), 752252 word types, 6250000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6260000, processed 749110460 words (5056101/s), 752871 word types, 6260000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6270000, processed 750169949 words (4974180/s), 753504 word types, 6270000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6280000, processed 751248230 words (4999301/s), 754186 word types, 6280000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6290000, processed 752329535 words (5052097/s), 754838 word types, 6290000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6300000, processed 753398402 words (5079854/s), 755490 word types, 6300000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6310000, processed 754475069 words (5060042/s), 756168 word types, 6310000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6320000, processed 755555107 words (4970851/s), 756776 word types, 6320000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6330000, processed 756616971 words (5006922/s), 757365 word types, 6330000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6340000, processed 757702358 words (5109154/s), 758018 word types, 6340000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6350000, processed 758780045 words (5116953/s), 758552 word types, 6350000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6360000, processed 759839285 words (5097360/s), 759138 word types, 6360000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6370000, processed 760913993 words (5114810/s), 759805 word types, 6370000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6380000, processed 761985118 words (5106319/s), 760520 word types, 6380000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6390000, processed 763074874 words (5147876/s), 761096 word types, 6390000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6400000, processed 764149557 words (5142320/s), 761739 word types, 6400000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6410000, processed 765216221 words (5177150/s), 762347 word types, 6410000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6420000, processed 766279513 words (5053824/s), 763041 word types, 6420000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6430000, processed 767352568 words (5275067/s), 763599 word types, 6430000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6440000, processed 768417313 words (5489043/s), 764324 word types, 6440000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6450000, processed 769411861 words (5450415/s), 764861 word types, 6450000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6460000, processed 770410253 words (5418291/s), 765516 word types, 6460000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6470000, processed 771427524 words (5395037/s), 766160 word types, 6470000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6480000, processed 772417515 words (5424684/s), 766671 word types, 6480000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6490000, processed 773409151 words (5446160/s), 767304 word types, 6490000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6500000, processed 774409330 words (5492210/s), 767938 word types, 6500000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6510000, processed 775412954 words (5463107/s), 768569 word types, 6510000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6520000, processed 776405055 words (5461691/s), 769088 word types, 6520000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6530000, processed 777397331 words (5329160/s), 769666 word types, 6530000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6540000, processed 778414812 words (5249703/s), 770280 word types, 6540000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6550000, processed 779436988 words (5382187/s), 770941 word types, 6550000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6560000, processed 780442930 words (5324499/s), 771574 word types, 6560000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6570000, processed 781445489 words (5342760/s), 772186 word types, 6570000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6580000, processed 782459745 words (5377046/s), 772793 word types, 6580000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6590000, processed 783469888 words (5456031/s), 773428 word types, 6590000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6600000, processed 784459349 words (5425956/s), 774021 word types, 6600000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6610000, processed 785478726 words (5457971/s), 774702 word types, 6610000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6620000, processed 786480967 words (5463314/s), 775334 word types, 6620000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6630000, processed 787494274 words (5417313/s), 775962 word types, 6630000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6640000, processed 788508937 words (5319904/s), 776544 word types, 6640000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6650000, processed 789515161 words (5454860/s), 777104 word types, 6650000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6660000, processed 790527851 words (5455099/s), 777726 word types, 6660000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6670000, processed 791522885 words (5376227/s), 778386 word types, 6670000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6680000, processed 792502982 words (5382303/s), 778924 word types, 6680000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6690000, processed 793500876 words (5368138/s), 779457 word types, 6690000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6700000, processed 794503952 words (5043006/s), 780079 word types, 6700000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6710000, processed 795497045 words (4946851/s), 780617 word types, 6710000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6720000, processed 796529793 words (5203627/s), 781170 word types, 6720000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6730000, processed 797543437 words (5423764/s), 781716 word types, 6730000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6740000, processed 798550296 words (5384985/s), 782251 word types, 6740000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6750000, processed 799555965 words (5419300/s), 782856 word types, 6750000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6760000, processed 800563768 words (5415204/s), 783437 word types, 6760000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6770000, processed 801575645 words (5224148/s), 783989 word types, 6770000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6780000, processed 802599290 words (5367223/s), 784582 word types, 6780000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6790000, processed 803605320 words (5307291/s), 785090 word types, 6790000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6800000, processed 804627238 words (5201631/s), 785659 word types, 6800000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6810000, processed 805638073 words (5232933/s), 786192 word types, 6810000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6820000, processed 806644402 words (5435492/s), 786751 word types, 6820000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6830000, processed 807636741 words (5345383/s), 787290 word types, 6830000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6840000, processed 808639076 words (5332593/s), 787797 word types, 6840000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6850000, processed 809653507 words (5362187/s), 788424 word types, 6850000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6860000, processed 810683562 words (5261223/s), 788976 word types, 6860000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6870000, processed 811714137 words (5207332/s), 789614 word types, 6870000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6880000, processed 812784565 words (5021532/s), 790299 word types, 6880000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6890000, processed 813832418 words (5146241/s), 790988 word types, 6890000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6900000, processed 814894243 words (5046310/s), 791627 word types, 6900000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6910000, processed 815954603 words (5263872/s), 792279 word types, 6910000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6920000, processed 817022865 words (5168799/s), 792885 word types, 6920000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6930000, processed 818068852 words (5088566/s), 793511 word types, 6930000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6940000, processed 819129145 words (5071436/s), 794060 word types, 6940000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6950000, processed 820170970 words (5234666/s), 794677 word types, 6950000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6960000, processed 821230509 words (5166634/s), 795277 word types, 6960000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6970000, processed 822303672 words (5095511/s), 795911 word types, 6970000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6980000, processed 823374263 words (5241489/s), 796558 word types, 6980000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #6990000, processed 824454932 words (5273970/s), 797190 word types, 6990000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7000000, processed 825510912 words (5257604/s), 797871 word types, 7000000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7010000, processed 826584470 words (5078160/s), 798452 word types, 7010000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7020000, processed 827646855 words (5277621/s), 799129 word types, 7020000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7030000, processed 828692421 words (5321586/s), 799725 word types, 7030000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7040000, processed 829765604 words (5348083/s), 800370 word types, 7040000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7050000, processed 830844545 words (5336730/s), 801082 word types, 7050000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7060000, processed 831919282 words (5365358/s), 801645 word types, 7060000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7070000, processed 832984400 words (5375578/s), 802242 word types, 7070000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7080000, processed 834044866 words (5267980/s), 802884 word types, 7080000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7090000, processed 835126308 words (5058398/s), 803597 word types, 7090000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7100000, processed 836206913 words (5258658/s), 804238 word types, 7100000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7110000, processed 837272767 words (5252640/s), 804832 word types, 7110000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7120000, processed 838336480 words (5396838/s), 805565 word types, 7120000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7130000, processed 839411902 words (5361271/s), 806215 word types, 7130000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7140000, processed 840483700 words (5361273/s), 806813 word types, 7140000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7150000, processed 841555269 words (5301241/s), 807479 word types, 7150000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7160000, processed 842622314 words (5441055/s), 808118 word types, 7160000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7170000, processed 843692457 words (5405476/s), 808700 word types, 7170000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7180000, processed 844763221 words (5334613/s), 809287 word types, 7180000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7190000, processed 845823658 words (5228497/s), 809892 word types, 7190000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7200000, processed 846904982 words (5380806/s), 810460 word types, 7200000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7210000, processed 847976939 words (5296192/s), 811019 word types, 7210000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7220000, processed 849037356 words (5396731/s), 811544 word types, 7220000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7230000, processed 850103915 words (5215062/s), 812134 word types, 7230000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7240000, processed 851119066 words (5211574/s), 812714 word types, 7240000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7250000, processed 852116802 words (5312448/s), 813249 word types, 7250000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7260000, processed 853133178 words (5310865/s), 813862 word types, 7260000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7270000, processed 854147382 words (5372392/s), 814371 word types, 7270000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7280000, processed 855143039 words (5267046/s), 814960 word types, 7280000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7290000, processed 856149708 words (5192510/s), 815509 word types, 7290000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7300000, processed 857153995 words (5327572/s), 816110 word types, 7300000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7310000, processed 858147838 words (5246043/s), 816671 word types, 7310000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7320000, processed 859143371 words (5302076/s), 817261 word types, 7320000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7330000, processed 860146912 words (5383303/s), 817877 word types, 7330000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7340000, processed 861159030 words (5288423/s), 818512 word types, 7340000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7350000, processed 862150002 words (5258194/s), 819037 word types, 7350000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7360000, processed 863143573 words (5347014/s), 819626 word types, 7360000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7370000, processed 864148016 words (5403081/s), 820234 word types, 7370000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7380000, processed 865141337 words (5373363/s), 820767 word types, 7380000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7390000, processed 866145899 words (5402122/s), 821311 word types, 7390000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7400000, processed 867157728 words (5337303/s), 821799 word types, 7400000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7410000, processed 868152744 words (5306967/s), 822406 word types, 7410000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7420000, processed 869151503 words (5221095/s), 822980 word types, 7420000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7430000, processed 870156027 words (5131210/s), 823538 word types, 7430000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7440000, processed 871162842 words (5415731/s), 824127 word types, 7440000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7450000, processed 872173779 words (5290842/s), 824593 word types, 7450000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7460000, processed 873174548 words (5329357/s), 825166 word types, 7460000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7470000, processed 874186100 words (5416195/s), 825640 word types, 7470000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7480000, processed 875189573 words (5357672/s), 826190 word types, 7480000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7490000, processed 876200410 words (5379373/s), 826768 word types, 7490000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7500000, processed 877192484 words (5379776/s), 827314 word types, 7500000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7510000, processed 878181478 words (5372351/s), 827801 word types, 7510000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7520000, processed 879191459 words (5310130/s), 828262 word types, 7520000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7530000, processed 880217227 words (5355563/s), 828765 word types, 7530000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7540000, processed 881218832 words (5344707/s), 829348 word types, 7540000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7550000, processed 882201910 words (5344664/s), 829860 word types, 7550000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7560000, processed 883201190 words (5324974/s), 830402 word types, 7560000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7570000, processed 884198601 words (5336860/s), 830988 word types, 7570000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7580000, processed 885190916 words (5303278/s), 831480 word types, 7580000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7590000, processed 886207648 words (5357444/s), 832004 word types, 7590000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7600000, processed 887217484 words (5304168/s), 832535 word types, 7600000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7610000, processed 888230663 words (5361735/s), 833066 word types, 7610000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7620000, processed 889238431 words (5387036/s), 833547 word types, 7620000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7630000, processed 890237167 words (5350514/s), 834043 word types, 7630000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7640000, processed 891250741 words (5284831/s), 834590 word types, 7640000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7650000, processed 892255564 words (5403187/s), 835128 word types, 7650000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7660000, processed 893266646 words (5330018/s), 835612 word types, 7660000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7670000, processed 894299591 words (5330138/s), 836218 word types, 7670000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7680000, processed 895339140 words (5274396/s), 836861 word types, 7680000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7690000, processed 896380656 words (5430671/s), 837344 word types, 7690000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7700000, processed 897428276 words (5378326/s), 837925 word types, 7700000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7710000, processed 898469112 words (5248249/s), 838453 word types, 7710000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7720000, processed 899525923 words (5092041/s), 839050 word types, 7720000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7730000, processed 900571916 words (5349010/s), 839668 word types, 7730000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7740000, processed 901614247 words (5112444/s), 840232 word types, 7740000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7750000, processed 902661093 words (5185491/s), 840888 word types, 7750000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7760000, processed 903713576 words (5219991/s), 841454 word types, 7760000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7770000, processed 904761659 words (5264914/s), 842000 word types, 7770000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7780000, processed 905811997 words (5298633/s), 842552 word types, 7780000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7790000, processed 906868026 words (5165708/s), 843164 word types, 7790000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7800000, processed 907915281 words (5396447/s), 843724 word types, 7800000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7810000, processed 908966364 words (5385446/s), 844239 word types, 7810000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7820000, processed 910031312 words (5417029/s), 844847 word types, 7820000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7830000, processed 911074638 words (5435761/s), 845404 word types, 7830000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7840000, processed 912121331 words (5476052/s), 845942 word types, 7840000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7850000, processed 913184645 words (5468921/s), 846505 word types, 7850000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7860000, processed 914247434 words (5479555/s), 847130 word types, 7860000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7870000, processed 915307924 words (5471168/s), 847738 word types, 7870000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7880000, processed 916378541 words (5511646/s), 848381 word types, 7880000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7890000, processed 917439898 words (5438744/s), 849053 word types, 7890000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7900000, processed 918519316 words (5477750/s), 849627 word types, 7900000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7910000, processed 919590952 words (5462641/s), 850181 word types, 7910000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7920000, processed 920644007 words (5445900/s), 850766 word types, 7920000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7930000, processed 921685916 words (5446151/s), 851391 word types, 7930000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7940000, processed 922747348 words (5455157/s), 851927 word types, 7940000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7950000, processed 923819602 words (5453819/s), 852480 word types, 7950000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7960000, processed 924891719 words (5212578/s), 853065 word types, 7960000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7970000, processed 925956687 words (5567106/s), 853633 word types, 7970000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7980000, processed 927000415 words (5146690/s), 854168 word types, 7980000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #7990000, processed 928078927 words (5330595/s), 854673 word types, 7990000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8000000, processed 929115019 words (5431323/s), 855212 word types, 8000000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8010000, processed 930183832 words (5120677/s), 855747 word types, 8010000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8020000, processed 931252231 words (5499062/s), 856256 word types, 8020000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8030000, processed 932314998 words (5323249/s), 856795 word types, 8030000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8040000, processed 933381737 words (5035984/s), 857258 word types, 8040000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8050000, processed 934463317 words (5369851/s), 857865 word types, 8050000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8060000, processed 935464988 words (5072689/s), 858465 word types, 8060000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8070000, processed 936459580 words (5250601/s), 859039 word types, 8070000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8080000, processed 937448353 words (5173738/s), 859591 word types, 8080000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8090000, processed 938453722 words (4939410/s), 860250 word types, 8090000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8100000, processed 939438419 words (4920575/s), 860804 word types, 8100000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8110000, processed 940434666 words (4907243/s), 861324 word types, 8110000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8120000, processed 941447932 words (4940523/s), 861855 word types, 8120000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8130000, processed 942458436 words (4967138/s), 862411 word types, 8130000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8140000, processed 943478113 words (4979213/s), 862977 word types, 8140000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8150000, processed 944489610 words (4944967/s), 863585 word types, 8150000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8160000, processed 945504816 words (4959214/s), 864159 word types, 8160000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8170000, processed 946514000 words (4948909/s), 864692 word types, 8170000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8180000, processed 947515977 words (4979802/s), 865211 word types, 8180000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8190000, processed 948526874 words (4954778/s), 865732 word types, 8190000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8200000, processed 949545761 words (5320515/s), 866293 word types, 8200000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8210000, processed 950556635 words (5000663/s), 866773 word types, 8210000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8220000, processed 951582493 words (5142655/s), 867390 word types, 8220000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8230000, processed 952593228 words (5391662/s), 867917 word types, 8230000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8240000, processed 953603712 words (5226432/s), 868499 word types, 8240000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8250000, processed 954598320 words (5262657/s), 868940 word types, 8250000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8260000, processed 955620900 words (5171666/s), 869479 word types, 8260000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8270000, processed 956638793 words (5211725/s), 870077 word types, 8270000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8280000, processed 957641729 words (5332046/s), 870590 word types, 8280000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8290000, processed 958638333 words (5175964/s), 871133 word types, 8290000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8300000, processed 959644176 words (5200956/s), 871675 word types, 8300000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8310000, processed 960649465 words (5244613/s), 872199 word types, 8310000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8320000, processed 961648327 words (5323954/s), 872745 word types, 8320000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8330000, processed 962655442 words (5258374/s), 873281 word types, 8330000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8340000, processed 963670789 words (5302002/s), 873889 word types, 8340000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8350000, processed 964660565 words (5288343/s), 874371 word types, 8350000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8360000, processed 965676452 words (5294006/s), 874912 word types, 8360000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8370000, processed 966691364 words (5229453/s), 875427 word types, 8370000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8380000, processed 967708791 words (5270529/s), 875925 word types, 8380000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8390000, processed 968726911 words (5415950/s), 876430 word types, 8390000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8400000, processed 969732973 words (5424788/s), 876990 word types, 8400000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8410000, processed 970738336 words (5331881/s), 877503 word types, 8410000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8420000, processed 971737762 words (5279362/s), 878034 word types, 8420000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8430000, processed 972737734 words (5337356/s), 878541 word types, 8430000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8440000, processed 973737068 words (5354249/s), 879068 word types, 8440000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8450000, processed 974758430 words (5370533/s), 879570 word types, 8450000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8460000, processed 975783634 words (5422894/s), 880054 word types, 8460000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8470000, processed 976832791 words (5452541/s), 880620 word types, 8470000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8480000, processed 977885781 words (5362420/s), 881210 word types, 8480000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8490000, processed 978962853 words (5390657/s), 881797 word types, 8490000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8500000, processed 979996289 words (5302498/s), 882305 word types, 8500000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8510000, processed 981061658 words (5395634/s), 882898 word types, 8510000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8520000, processed 982119296 words (5300704/s), 883422 word types, 8520000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8530000, processed 983176393 words (5402273/s), 883924 word types, 8530000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8540000, processed 984233172 words (5360205/s), 884506 word types, 8540000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8550000, processed 985282762 words (5456869/s), 885041 word types, 8550000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8560000, processed 986342571 words (5365202/s), 885591 word types, 8560000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8570000, processed 987400510 words (5375015/s), 886202 word types, 8570000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8580000, processed 988468339 words (5319794/s), 886778 word types, 8580000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8590000, processed 989513759 words (5302929/s), 887367 word types, 8590000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8600000, processed 990572270 words (5270856/s), 887921 word types, 8600000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8610000, processed 991660254 words (5236794/s), 888472 word types, 8610000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8620000, processed 992720782 words (5158917/s), 889067 word types, 8620000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8630000, processed 993792559 words (5321725/s), 889631 word types, 8630000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8640000, processed 994866890 words (5245632/s), 890207 word types, 8640000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8650000, processed 995933142 words (5236384/s), 890740 word types, 8650000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8660000, processed 996996896 words (5292507/s), 891291 word types, 8660000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8670000, processed 998069680 words (5162208/s), 891948 word types, 8670000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8680000, processed 999131430 words (5280315/s), 892642 word types, 8680000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8690000, processed 1000204456 words (5241900/s), 893230 word types, 8690000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8700000, processed 1001257596 words (5274117/s), 893785 word types, 8700000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8710000, processed 1002307639 words (5224401/s), 894316 word types, 8710000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8720000, processed 1003382891 words (5148460/s), 894950 word types, 8720000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8730000, processed 1004452916 words (5290062/s), 895509 word types, 8730000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8740000, processed 1005507936 words (5340660/s), 895956 word types, 8740000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8750000, processed 1006573514 words (5441266/s), 896502 word types, 8750000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8760000, processed 1007636423 words (5397144/s), 897040 word types, 8760000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8770000, processed 1008692280 words (5387456/s), 897525 word types, 8770000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8780000, processed 1009761643 words (5435394/s), 898092 word types, 8780000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8790000, processed 1010816163 words (5431515/s), 898677 word types, 8790000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8800000, processed 1011873386 words (5442392/s), 899205 word types, 8800000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8810000, processed 1012927939 words (5326755/s), 899746 word types, 8810000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8820000, processed 1013990003 words (5396151/s), 900392 word types, 8820000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8830000, processed 1015087386 words (5430044/s), 900902 word types, 8830000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8840000, processed 1016093069 words (5347295/s), 901437 word types, 8840000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8850000, processed 1017093798 words (5379917/s), 902051 word types, 8850000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8860000, processed 1018083139 words (5329834/s), 902598 word types, 8860000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8870000, processed 1019094258 words (5337255/s), 903121 word types, 8870000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8880000, processed 1020106852 words (5384870/s), 903645 word types, 8880000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8890000, processed 1021111044 words (5381243/s), 904129 word types, 8890000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8900000, processed 1022123961 words (5334674/s), 904704 word types, 8900000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8910000, processed 1023121165 words (5391200/s), 905261 word types, 8910000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8920000, processed 1024134264 words (5317637/s), 905751 word types, 8920000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8930000, processed 1025148122 words (5141274/s), 906244 word types, 8930000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8940000, processed 1026130935 words (5154760/s), 906770 word types, 8940000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8950000, processed 1027147952 words (5295651/s), 907301 word types, 8950000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8960000, processed 1028164201 words (5256183/s), 907814 word types, 8960000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8970000, processed 1029168492 words (5245737/s), 908351 word types, 8970000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8980000, processed 1030171424 words (5172151/s), 908899 word types, 8980000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #8990000, processed 1031174935 words (5140458/s), 909458 word types, 8990000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9000000, processed 1032174739 words (5142449/s), 909962 word types, 9000000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9010000, processed 1033178152 words (5148017/s), 910437 word types, 9010000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9020000, processed 1034199518 words (5189978/s), 910963 word types, 9020000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9030000, processed 1035228214 words (5191316/s), 911539 word types, 9030000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9040000, processed 1036228677 words (5272208/s), 912079 word types, 9040000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9050000, processed 1037256143 words (5329402/s), 912651 word types, 9050000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9060000, processed 1038280550 words (5327226/s), 913140 word types, 9060000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9070000, processed 1039285678 words (5239140/s), 913609 word types, 9070000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9080000, processed 1040291757 words (5313547/s), 914122 word types, 9080000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9090000, processed 1041299878 words (5261196/s), 914607 word types, 9090000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9100000, processed 1042307127 words (5278306/s), 915169 word types, 9100000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9110000, processed 1043315682 words (5240521/s), 915716 word types, 9110000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9120000, processed 1044337083 words (5316534/s), 916289 word types, 9120000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9130000, processed 1045344346 words (5395034/s), 916757 word types, 9130000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9140000, processed 1046366926 words (5299626/s), 917176 word types, 9140000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9150000, processed 1047363642 words (5242573/s), 917599 word types, 9150000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9160000, processed 1048375200 words (5338428/s), 918146 word types, 9160000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9170000, processed 1049390803 words (5154439/s), 918591 word types, 9170000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9180000, processed 1050400363 words (4745970/s), 919059 word types, 9180000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9190000, processed 1051409896 words (5168381/s), 919542 word types, 9190000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9200000, processed 1052425393 words (5300730/s), 920098 word types, 9200000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9210000, processed 1053435408 words (4912088/s), 920555 word types, 9210000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9220000, processed 1054428094 words (4806499/s), 921015 word types, 9220000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9230000, processed 1055445552 words (4872334/s), 921463 word types, 9230000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9240000, processed 1056467484 words (4763065/s), 921940 word types, 9240000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9250000, processed 1057508213 words (4907590/s), 922464 word types, 9250000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9260000, processed 1058555389 words (4694807/s), 923002 word types, 9260000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9270000, processed 1059592696 words (4727883/s), 923587 word types, 9270000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9280000, processed 1060628466 words (4708915/s), 924119 word types, 9280000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9290000, processed 1061672124 words (4830953/s), 924636 word types, 9290000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9300000, processed 1062711537 words (5195176/s), 925138 word types, 9300000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9310000, processed 1063755864 words (5179010/s), 925634 word types, 9310000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9320000, processed 1064804384 words (5131554/s), 926175 word types, 9320000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9330000, processed 1065879558 words (5226342/s), 926682 word types, 9330000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9340000, processed 1066918438 words (5101854/s), 927195 word types, 9340000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9350000, processed 1067971466 words (4998126/s), 927764 word types, 9350000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9360000, processed 1069025667 words (5134958/s), 928302 word types, 9360000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9370000, processed 1070074552 words (5107953/s), 928922 word types, 9370000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9380000, processed 1071107659 words (5167064/s), 929434 word types, 9380000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9390000, processed 1072143480 words (5184308/s), 929953 word types, 9390000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9400000, processed 1073186248 words (5184543/s), 930451 word types, 9400000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9410000, processed 1074240569 words (4833677/s), 930900 word types, 9410000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9420000, processed 1075278343 words (4835055/s), 931440 word types, 9420000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9430000, processed 1076338035 words (4395695/s), 932034 word types, 9430000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9440000, processed 1077404188 words (4687297/s), 932650 word types, 9440000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9450000, processed 1078472820 words (4698026/s), 933210 word types, 9450000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9460000, processed 1079523557 words (4316769/s), 933752 word types, 9460000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9470000, processed 1080608048 words (4757137/s), 934506 word types, 9470000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9480000, processed 1081663477 words (5116028/s), 935093 word types, 9480000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9490000, processed 1082720125 words (4214157/s), 935576 word types, 9490000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9500000, processed 1083795757 words (4440051/s), 936112 word types, 9500000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9510000, processed 1084871159 words (4441010/s), 936719 word types, 9510000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9520000, processed 1085939199 words (4546469/s), 937208 word types, 9520000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9530000, processed 1087009861 words (4835153/s), 937753 word types, 9530000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9540000, processed 1088084060 words (4830998/s), 938273 word types, 9540000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9550000, processed 1089151131 words (5020891/s), 938813 word types, 9550000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9560000, processed 1090221349 words (4239989/s), 939392 word types, 9560000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9570000, processed 1091306090 words (5055772/s), 939951 word types, 9570000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9580000, processed 1092369219 words (3955594/s), 940509 word types, 9580000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9590000, processed 1093450155 words (4704442/s), 940982 word types, 9590000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9600000, processed 1094517044 words (4624730/s), 941459 word types, 9600000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9610000, processed 1095596774 words (5149641/s), 941974 word types, 9610000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9620000, processed 1096649909 words (5451649/s), 942445 word types, 9620000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9630000, processed 1097730779 words (5350326/s), 942938 word types, 9630000 tags INFO:gensim.models.doc2vec:PROGRESS: at example #9640000, processed 1098833075 words (3801916/s), 943467 word types, 9640000 tags INFO:gensim.models.doc2vec:collected 943652 word types and 9643078 unique tags from a corpus of 9643078 examples and 1099181249 words INFO:gensim.models.word2vec:Loading a fresh vocabulary INFO:gensim.models.word2vec:effective_min_count=10 retains 153675 unique words (16% of original 943652, drops 789977) INFO:gensim.models.word2vec:effective_min_count=10 leaves 1097622005 word corpus (99% of original 1099181249, drops 1559244) INFO:gensim.models.word2vec:deleting the raw counts dictionary of 943652 items INFO:gensim.models.word2vec:sample=0.001 downsamples 55 most-common words INFO:gensim.models.word2vec:downsampling leaves estimated 834934251 word corpus (76.1% of prior 1097622005) INFO:gensim.models.base_any2vec:estimated required memory for 153675 words and 250 dimensions: 10027265500 bytes INFO:gensim.models.word2vec:resetting layer weights INFO:gensim.models.base_any2vec:training model with 5 workers on 153675 vocabulary and 250 features, using sg=0 hs=0 sample=0.001 negative=5 window=5