项目作者: mdv3101

项目描述 :
A TF-IDF (Term Frequency & Inverse Document Frequency) based search algorithm for searching a small subset of Wikipedia Data using Apache Spark Cluster of 3 Nodes on top of HDFS, hosted on AWS, having web UI with Django.
高级语言: Python
项目地址: git://github.com/mdv3101/Wikipedia_Search_Engine.git
创建时间: 2017-04-10T15:59:26Z
项目社区:https://github.com/mdv3101/Wikipedia_Search_Engine

开源协议:

下载