forked from wuhaifengdhu/MonsterCrawler
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathvector.dat
2734 lines (2734 loc) · 31.3 KB
/
vector.dat
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
(lp1
(lp2
V(15, 20]
p3
aV(5, 10]
p4
aV(0, 2]
p5
aV(2, 5]
p6
aV(20, 30)
p7
aS'[0]'
p8
aV(10, 15]
p9
aa(lp10
VBachelor
p11
aVMaster
p12
aVDoctor
p13
aa(lp14
VLinguistics
p15
aVStatistics
p16
aVPsychology
p17
aVSociology
p18
aVAstrophysics
p19
aVMathematics
p20
aVBusiness
p21
aVInformation Science
p22
aVComputer Science
p23
aVData Science
p24
aVEconomics
p25
aVCognitive Science
p26
aVChemical Engineering
p27
aVBiological Engineering
p28
aVPhysics
p29
aVManagement Science and Engineering
p30
aVTransportation Engineering
p31
aVIndustrial Engineering
p32
aVElectrical and Electronics Engineering
p33
aVEngineering
p34
aVEnvironmental Engineering
p35
aVOperations Research
p36
aVGeography
p37
aa(lp38
S'differential equations'
p39
aS'edc'
p40
aS'qualitative research'
p41
aS'energy efficiency'
p42
aS'chinese'
p43
aS'apache storm'
p44
aS'hedge funds'
p45
aS'redis'
p46
aS'chaid'
p47
aS'computer forensics'
p48
aS'bioconductor'
p49
aS'ggplot2'
p50
aS'dsp'
p51
aS'system design'
p52
aS'go'
p53
aS'user experience design'
p54
aS'equity research'
p55
aS'microsoft sql server'
p56
aS'content management'
p57
aS'technical leadership'
p58
aS'game design'
p59
aS'cognitive neuroscience'
p60
aS'flask'
p61
aS'digital imaging'
p62
aS'fuzzy logic'
p63
aS'scientific visualization'
p64
aS'behavioral targeting'
p65
aS'matlab'
p66
aS'agile methodologies'
p67
aS'master data management'
p68
aS'drug discovery'
p69
aS'junit'
p70
aS'data entry'
p71
aS'materials science'
p72
aS'computational mathematics'
p73
aS'semiconductor industry'
p74
aS'sap bi'
p75
aS'cloudera impala'
p76
aS'automotive'
p77
aS'etl tools'
p78
aS'health informatics'
p79
aS'computer hardware'
p80
aS'decision trees'
p81
aS'facebook'
p82
aS'nanomaterials'
p83
aS'metabolomics'
p84
aS'erp'
p85
aS'infectious diseases'
p86
aS'voip'
p87
aS'event management'
p88
aS'grid computing'
p89
aS'employee engagement'
p90
aS'risk assessment'
p91
aS'solver'
p92
aS'lisp'
p93
aS'mercurial'
p94
aS'risk modeling'
p95
aS'mapreduce'
p96
aS'hive'
p97
aS'engineering management'
p98
aS'civil engineering'
p99
aS'olap'
p100
aS'financial services'
p101
aS'asp'
p102
aS'real estate'
p103
aS'penetration testing'
p104
aS'cms'
p105
aS'functional programming'
p106
aS'predictive analytics'
p107
aS'logistics'
p108
aS'numerical analysis'
p109
aS'air force'
p110
aS'pmp'
p111
aS'unsupervised learning'
p112
aS'global health'
p113
aS'network science'
p114
aS'computational neuroscience'
p115
aS'teradata'
p116
aS'cvs'
p117
aS'business insights'
p118
aS'meta-analysis'
p119
aS'process improvement'
p120
aS'data-driven decision making'
p121
aS'information design'
p122
aS'it management'
p123
aS'batch processing'
p124
aS'pmo'
p125
aS'literature reviews'
p126
aVpanda
p127
aS'vmware'
p128
aS'job analysis'
p129
aS'solution selling'
p130
aS'evolutionary biology'
p131
aS'statistical consulting'
p132
aS'network optimization'
p133
aS'datastage'
p134
aS'clinical development'
p135
aS'microbiology'
p136
aS'time management'
p137
aS'access'
p138
aS'cancer research'
p139
aS'ssis'
p140
aS'large-scale data analysis'
p141
aS'esri'
p142
aS'mac os'
p143
aS'professional services'
p144
aS'awk'
p145
aS'go-to-market strategy'
p146
aS'segmentation'
p147
aS'team leadership'
p148
aS'transportation'
p149
aS'environmental science'
p150
aS'stochastic processes'
p151
aS'random forests'
p152
aS'matplotlib'
p153
aS'customer analysis'
p154
aS'water'
p155
aS'crystal reports'
p156
aS'sql db2'
p157
aS'21 cfr part 11'
p158
aS'statistical modeling'
p159
aS'ubuntu'
p160
aS'apache'
p161
aS'openmp'
p162
aS'science communication'
p163
aS'b2b'
p164
aS'information technology'
p165
aS'business process management'
p166
aS'software development'
p167
aS'medical imaging'
p168
aS'search'
p169
aS'glm'
p170
aS'final cut pro'
p171
aS'data cleaning'
p172
aS'root'
p173
aS'consumer insights'
p174
aS'economics'
p175
aS'test automation'
p176
aS'multivariate statistics'
p177
aS'credit'
p178
aS'rdf'
p179
aS'product launch'
p180
aS'advertising'
p181
aS'windows azure'
p182
aS'military'
p183
aS'microsoft access'
p184
aS'personalized medicine'
p185
aS'options'
p186
aS'patents'
p187
aS'qualtrics'
p188
aS'computer architecture'
p189
aS'classification'
p190
aS'cdma'
p191
aS'probabilistic models'
p192
aS'clinical study design'
p193
aS'adobe creative suite'
p194
aS'healthcare analytics'
p195
aS'test driven development'
p196
aS'graphic design'
p197
aS'vis'
p198
aS'unix'
p199
aS'healthcare information technology'
p200
aS'account management'
p201
aS'mobile advertising'
p202
aS'web scraping'
p203
aS'outsourcing'
p204
aS'experimentation'
p205
aS'social media'
p206
aS'conjoint analysis'
p207
aS'web analytics'
p208
aS'open source'
p209
aS'army'
p210
aS'j2ee'
p211
aS'microeconomics'
p212
aS'petroleum'
p213
aS'devops'
p214
aS'governance'
p215
aS'perl'
p216
aS'crowdsourcing'
p217
aS'risk management'
p218
aS'campaign management'
p219
aS'visio'
p220
aS'bonds'
p221
aS'erwin'
p222
aS'gsm'
p223
aS'data management'
p224
aS'quality assurance'
p225
aS'structural equation modeling'
p226
aS'performance management'
p227
aS'web design'
p228
aS'rapidminer'
p229
aS'lean manufacturing'
p230
aS'jmp'
p231
aS'hl7'
p232
aS'cplex'
p233
aS'business process'
p234
aS'process automation'
p235
aS'investment strategies'
p236
aS'computational linguistics'
p237
aS'data processing'
p238
aS'teaching'
p239
aS'html 5'
p240
aS'project management'
p241
aS'grant writing'
p242
aS'database administration'
p243
aS'scientific analysis'
p244
aS'genetic algorithms'
p245
aS'word'
p246
aS'science'
p247
aS'applied mathematics'
p248
aS'eclipse'
p249
aS'web content'
p250
aS'machine learning'
p251
aS'psychometrics'
p252
aS'hidden markov models'
p253
aS'mobile applications'
p254
aS'budgets'
p255
aS'customer acquisition'
p256
aS'visual basic'
p257
aS'ms office suite'
p258
aS'cluster analysis'
p259
aS'scripting'
p260
aS'cad'
p261
aS'speech recognition'
p262
aS'parallel computing'
p263
aS'educational technology'
p264
aS'portfolio management'
p265
aS'semantic web'
p266
aS'cognitive science'
p267
aS'financial risk management'
p268
aS'revenue analysis'
p269
aS'display advertising'
p270
aS'portfolio optimization'
p271
aS'simulation'
p272
aS'random forest'
p273
aS'business modeling'
p274
aS'simulink'
p275
aS'sip'
p276
aS'cryptography'
p277
aS'contract negotiation'
p278
aS'power bi'
p279
aS'sharepoint'
p280
aS'sas enterprise guide'
p281
aS'tableau'
p282
aS'application architecture'
p283
aS'market research'
p284
aS'gcp'
p285
aS'computer vision'
p286
aS'business planning'
p287
aS'ehr'
p288
aS'text classification'
p289
aS'product innovation'
p290
aS'amazon s3'
p291
aS'hypothesis testing'
p292
aS'unstructured data'
p293
aS'css'
p294
aS'3d visualization'
p295
aS'object recognition'
p296
aS'computational biology'
p297
aS'algebra'
p298
aS'philosophy'
p299
aS'leadership development'
p300
aS'multivariate testing'
p301
aS'entrepreneurship'
p302
aS'sustainability'
p303
aS'theano'
p304
aS'political science'
p305
aS'data wrangling'
p306
aS'arduino'
p307
aS'bootstrapping'
p308
aS'extract transform load etl'
p309
aS'mpi'
p310
aS'data'
p311
aS'metrology'
p312
aS'a'
aS'technical training'
p313
aS'github'
p314
aS'sparql'
p315
aS'commodity'
p316
aS'remote sensing'
p317
aS'object oriented design'
p318
aS'market making'
p319
aS'documentation'
p320
aS'derivatives'
p321
aS'django'
p322
aS'google analytics'
p323
aS'causal inference'
p324
aS'sharepoint designer'
p325
aS'algorithms'
p326
aS'healthcare'
p327
aS'tutoring'
p328
aS'nltk'
p329
aS'aerospace engineering'
p330
aS'jira'
p331
aS'photoshop'
p332
aS'oncology'
p333
aS'b2b marketing'
p334
aS'health economics'
p335
aS'sql server reporting services ssrs'
p336
aS'crystal ball'
p337
aS'intellectual property'
p338
aS'data warehouse architecture'
p339
aS'data acquisition'
p340
aS'strategic leadership'
p341
aS'radar'
p342
aS'quality control'
p343
aS'marketing research'
p344
aS'hbase'
p345
aS'multi-channel marketing'
p346
aS'fundraising'
p347
aS'rnaseq'
p348
aS'linguistics'
p349
aS'db2'
p350
aS'capital markets'
p351
aS'google docs'
p352
aS'anomaly detection'
p353
aS'pcr'
p354
aS'process engineering'
p355
aS'bloomberg terminal'
p356
aS'product marketing'
p357
aS'monte carlo simulation'
p358
aS'writing'
p359
aS'supply chain management'
p360
aS'digital media'
p361
aS'coaching'
p362
aS'spring framework'
p363
aS'statistical software'
p364
aS'wordpress'
p365
aS'microstrategy'
p366
aS'reverse engineering'
p367
aS'policy'
p368
aS'linux system administration'
p369
aS'pca'
p370
aS'soa'
p371
aS'operational planning'
p372
aS'finance'
p373
aS'customer relationship management crm'
p374
aS'python'
p375
aS'semantic analysis'
p376
aS'private equity'
p377
aS'postgis'
p378
aS'due diligence'
p379
aS'survey design'
p380
aS'scipy'
p381
aS'combinatorial optimization'
p382
aS'algorithm development'
p383
aS'technical documentation'
p384
aS'business development'
p385
aS'business transformation'
p386
aS'servers'
p387
aS'analytical skills'
p388
aS'cloudera'
p389
aVbayesian method
p390
aS'outcomes research'
p391
aS'spatial analysis'
p392
aS'innovation management'
p393
aS'svm'
p394
aS'.net'
p395
aS'communication skills'
p396
aS'user acceptance testing'
p397
aS'pharmaceutical industry'
p398
aS'program evaluation'
p399
aS'regression models'
p400
aS'association rules'
p401
aS'root cause analysis'
p402
aS'logistic regression'
p403
aS'drug development'
p404
aS'soc'
p405
aS'crm'
p406
aS'data analytics'
p407
aS'sap bw'
p408
aS'cartography'
p409
aS'knowledge management'
p410
aS'energy'
p411
aS'decision support'
p412
aS'computational chemistry'
p413
aS'information extraction'
p414
aS'sas base'
p415
aS'enterprise software'
p416
aS'trading'
p417
aS'ec2'
p418
aS'personalization'
p419
aS'relational databases'
p420
aS'firmware'
p421
aS'direct marketing'
p422
aS'insurance'
p423
aS'collaborative problem solving'
p424
aS'requirements management'
p425
aS'psychology'
p426
aS'jdbc'
p427
aS'image processing'
p428
aS'ssas'
p429
aS'questionnaire design'
p430
aS'web 2.0'
p431
aS'scientific computing'
p432
aS'research design'
p433
aS'research'
p434
aS'mahout'
p435
aS'urban planning'
p436
aS'vendor management'
p437
aS'survey methodology'
p438
aS'database marketing'
p439
aS'sdlc'
p440
aS'business requirements'
p441
aS'medicine'
p442
aS'electronics'
p443
aS'it service management'
p444
aS'netbeans'
p445
aS'statistical tools'
p446
aS'quickbooks'
p447
aS'molecular modeling'
p448
aS'factor analysis'
p449
aS'hibernate'
p450
aS'embedded software'
p451
aS'theory'
p452
aS'transportation engineering'
p453
aS'client presentation'
p454
aS'cissp'
p455
aS'data scientist'
p456
aS'cart'
p457
aS'data quality'
p458
aS'business analytics'
p459
aS'ajax'
p460
aS'statistics'
p461
aS'algorithm design'
p462
aS'foreign policy'
p463
aS'gis'
p464
aS'sensors'
p465
aS'mathematical analysis'
p466
aS'photography'
p467
aS'bash'
p468
aS'training'
p469
aS'knime'
p470
aS'open source software'
p471
aS'partial differential equations'
p472
aS'drupal'
p473
aS'optics'
p474
aS'sustainable development'
p475
aS'programming'
p476
aS'mobile devices'
p477
aS'business objects'
p478
aS'traffic engineering'
p479
aS'mvc'
p480
aS'elasticsearch'
p481
aS'opencv'
p482
aS'lte'
p483
aS'data security'
p484
aS'requirements analysis'
p485
aS'field work'
p486
aS'retail'
p487
aS'rapid prototyping'
p488
aS'microsoft azure'
p489
aS'scalding'
p490
aS'customer insight'
p491
aS'probability'
p492
aS'indesign'
p493
aS'proteomics'
p494
aS'bioinformatics'
p495
aS'demand forecasting'
p496
aS'proofreading'
p497
aS'ecology'
p498
aS'marketing strategy'
p499
aS'spanish'
p500
aS'mathematical modeling'
p501
aS'neo4j'
p502