TagProc
From John's Wiki
TagProc
A Tag Processing PHP application.
For my own convenience I developed an PHP application, which I named TagProc. This application does the following:
* Scan weblogs and podcasts I subscribed to (registered in the database) (RSS or ATOM) * Store new entries of the weblogs and podcasts in the database * Split the titles of the entries up in seperate words (tags), skipping irrelevant tags (also stored in the database). * The application counts the frequency of the occurrence of each of the tags, giving it a frequency-value. * Using the frequency-value: ** a TagCloud of the most frequent occurring tags can be shown. ** a sorted list of entries can be displayed (the seperate tags of the title of the entry, give the entry as a whole a value).
The application also has a (upto now) limited administration interface
* create database tables * add (subscribe) to a new weblog or podcast * disable/enable of delete a subscribed weblog or podcast
It has the following todo's:
* build in an authorisation mechanism * build a administration screen for irrelevant tags
Demonstration
- TagCloud: http://www.johnvandervuurst.nl/tagproc.php?operation=tagcloud [1]
- List : http://www.johnvandervuurst.nl/tagproc.php?operation=list [2]
- TagList : http://www.johnvandervuurst.nl/tagproc.php?operation=list_tags [3]
- Subscriptions: http://www.johnvandervuurst.nl/tagproc.php?operation=list_podcast [4]
Download (beta-version)
Requirements: *Webserver (own / ISP or hosting) with XAMPP installed (Apache, MySQL, PHP and PERL)
Dependencies: *WordCloud php class (for actually displaying the tagcloud), written by Derek Harvey.
download here... tagproc.tar.gz
Download contains:
- tagproc.php --> the application itself
- dbcfg.php --> file with database configuration (see fileheader itself for details)
- classes/wordcloud.class.php --> the wordcloud class of Derek Harvey, included under GPL license.
Installation:
- unpack tarfile (tar -xvzf tagproc.tar.gz) and place it in the root of your webserver.
- subscribe to weblogs and podcasts using operation add_podcast
- make sure to regulary (using cron, at of a scheduled task in windows) to call the refresh and refresh_tagcloud operations (for scanning weblogs and refreshing tag statistics)
- add a call to the tagcloud operation in your html page. For instance in an IFRAME, or using an include or directly from php
Good luck...!
John van der Vuurst
BlogMarks
del.icio.us
digg
Fark
Furl
Newsvine
reddit
Segnalo
Simpy
Slashdot
smarking
Spurl
Wists