Get in Touch

Course Outline

  1. Distributed processing in big data
    1. Data mining methods (training a single model + distributed prediction: traditional machine learning algorithms + MapReduce distributed prediction)
    2. Apache Spark MLlib
  2. Recommendations and targeted ad delivery:
    1. Aspects of natural language processing
    2. Text clustering, text classification (labeling), synonym detection
    3. User profile restoration, tagging systems
    4. Strategies for recommendation algorithms
    5. Lift between classes, lift within classes, how to achieve precision
    6. How to build a closed loop for recommendation algorithms
  3. Logistic regression, RankingSVM
  4. Feature recognition: (Automatic feature recognition with deep learning and graphs)
  5. Natural Language Processing
    1. Chinese word segmentation
    2. Topic modeling (text clustering)
    3. Text classification
    4. Keyword extraction
    5. Semantic analysis: semantic parser, word2vec to word vectors
    6. RNN Long short-term memory (LSTM) architecture
 21 Hours

Number of participants


Price per participant

Testimonials (1)

Upcoming Courses

Related Categories