NewsPersonalizationSystem

  1. News Personalization System
    1. Initial Contributors
    2. Download
  2. Algorithm Overview
  3. NPS Architecture
  4. Clustering Algorithms
    1. User clustering - MinHash
      1. MinHash
    2. Clustering - PLSI Algorithm
    3. Covisitation count
  5. References


News Personalization System

Google's News Personalization System Clone Project.

Initial Contributors

Download

Algorithm Overview


NPS Architecture

                                             +----------------------------+
                                             |  Mail Frontend Web server  |
                                             +-----+---------------+------+
                                                   |       ↑       |
                                                   ACP communication
                                                   ↓       |       ↓
                          +--------------------------------+--------------+
                          |                    intermediation DB Servers  +---+
                          +----------------------------------------+------+   |
                              ↑                    ↑               |          |
                     Reads user profile       Read contents   Update contents   |
                              |                    |               ↓          |
    +-------------------------+-----+    +---------+----------------------+   |
    |       Hbase:UserTable         |    |        Hbase:MIMETable         |   |
    |     (user profile, logs)      |    |          (MIME data)           |   |
    +-------------------------------+    +--------------------------------+   |
                              ↑                   ↑                         |
+-----------------------------+--------------------+----------------------+   |
|  Hadoop: MapReduce Jobs (e.g. Log Analysis, Network Analysis, .., etc)  |←--+
+-------------------------------------------------------------------------+

Clustering Algorithms

User clustering - MinHash

MinHash

Clustering - PLSI Algorithm

Covisitation count


References

last edited 2008-12-04 04:45:15 by udanax