Warning: Parameter 1 to Language::getMagic() expected to be a reference, value given in /customers/johnvandervuurst.nl/johnvandervuurst.nl/httpd.www/wiki/includes/StubObject.php on line 58
TagProc - John's Wiki

TagProc

From John's Wiki

Share/Save/Bookmark
Jump to: navigation, search

TagProc

A Tag Processing PHP application.


For my own convenience I developed an PHP application, which I named TagProc. This application does the following:

  * Scan weblogs and podcasts I subscribed to (registered in the database) (RSS or ATOM)
  * Store new entries of the weblogs and podcasts in the database
  * Split the titles of the entries up in seperate words (tags), skipping irrelevant tags (also stored in the database).
  * The application counts the frequency of the occurrence of each of the tags, giving it a frequency-value.
  * Using the frequency-value:
  ** a TagCloud of the most frequent occurring tags can be shown.
  ** a sorted list of entries can be displayed (the seperate tags of the title of the entry, give the entry as a whole a value).

The application also has a (upto now) limited administration interface

  * create database tables
  * add (subscribe) to a new weblog or podcast
  * disable/enable of delete a subscribed weblog or podcast

It has the following todo's:

  * build in an authorisation mechanism
  * build a administration screen for irrelevant tags


Demonstration

Download (beta-version)

Requirements: *Webserver (own / ISP or hosting) with XAMPP installed (Apache, MySQL, PHP and PERL)

Dependencies: *WordCloud php class (for actually displaying the tagcloud), written by Derek Harvey.

download here... tagproc.tar.gz

Download contains:

  • tagproc.php --> the application itself
  • dbcfg.php --> file with database configuration (see fileheader itself for details)
  • classes/wordcloud.class.php --> the wordcloud class of Derek Harvey, included under GPL license.

Installation:

  • unpack tarfile (tar -xvzf tagproc.tar.gz) and place it in the root of your webserver.
  • subscribe to weblogs and podcasts using operation add_podcast
  • make sure to regulary (using cron, at of a scheduled task in windows) to call the refresh and refresh_tagcloud operations (for scanning weblogs and refreshing tag statistics)
  • add a call to the tagcloud operation in your html page. For instance in an IFRAME, or using an include or directly from php

Good luck...!


John van der Vuurst

Personal tools