News Personalization System

Google's News Personalization System Clone Project.

Initial Contributors

Download

Algorithm Overview


NPS Architecture

                                             +----------------------------+
                                             |  Mail Frontend Web server  |
                                             +-----+---------------+------+
                                                   |       ↑       |
                                                   ACP communication
                                                   ↓       |       ↓
                          +--------------------------------+--------------+
                          |                    intermediation DB Servers  +---+
                          +----------------------------------------+------+   |
                              ↑                    ↑               |          |
                     Reads user profile       Read contents   Update contents   |
                              |                    |               ↓          |
    +-------------------------+-----+    +---------+----------------------+   |
    |       Hbase:UserTable         |    |        Hbase:MIMETable         |   |
    |     (user profile, logs)      |    |          (MIME data)           |   |
    +-------------------------------+    +--------------------------------+   |
                              ↑                   ↑                         |
+-----------------------------+--------------------+----------------------+   |
|  Hadoop: MapReduce Jobs (e.g. Log Analysis, Network Analysis, .., etc)  |←--+
+-------------------------------------------------------------------------+


Clustering Algorithms

User clustering - MinHash

MinHash

Clustering - PLSI Algorithm

Covisitation count


References

NewsPersonalizationSystem (last edited 2009-09-20 23:54:14 by localhost)