numpy
scipy
pandas
python-docx
openpyxl
bs4
pillow
gensim
PyPDF2
nltk