Hi, We have built a procedure to extract data from wikipedia a chatbot purpose using word2vec similarites. - Cleaning and data preparations are an important work. using word2vec is a must or maybe doc2vec or paragraph2vec Able to tokenise sentences Able to detect entities, pronoums Able to manipula