Archive for November 21, 2006

Converting data to tab separated value format (TSV)

Techniques to convert data from a variety of sources (e.g. Excel, MS-Access, database ODBC, text files, etc. into the industry standard tab separated value format. Once data has been converted into this format, it may then be loaded into databases such as SQLite, MySQL, and is also in the proper format for data analysis using software packages such as EZ-R Stats for Windows and other software packages. View PDF

Commonly used Internal Audit Software Functions

Documentation showing audit techniques for the commonly used Internal Audit software functions. View document.

Auditors Guide to Tests Using Benford’s Law

Documentation showing audit techniques for the application of Benford’s Law.

Auditors Guide to Duplicate Payments

Documentation showing techniques used to identify duplicate payments. View document.

Auditors Guide to SQLite

Documentation showing how to use the open source SQLite database to perform common audit functions.  View.

Screen Shots for EZ-R Stats for Windows

Documentation showing example screen shots for all of the menu selections for EZ-R Stats for Windows.

IP Address to Country

Source code and procedures for converting an IP address in dot notation to the corresponding country code (two characters). Useful in identifying and classifying Web traffic for SEO (and other) purposes. Code is open source, public domain (no cost). This information is available for download as a “setup” file or as a zip file.

Analysis of Web Logs (Setup)

This is code I have contributed to the public domain - written in standard C++.  The code is available in two formats - a “setup” file and also a zip file. Note that the software produces a set of six output files (pge - ranked page visits, ref - list of referrers, ip - unique ip addresses visiting, ep - unique ip addresses downloading software, isp - list of major isp providers used.

Analysis of Web Logs

By carefully monitoring web logs, it is possible to optimize your web site in order to obtain a better ranking with the various search engines. I use the WebLog program to monitor traffic to my site, and I have experienced a compound rate of growth in excess of 38% monthly (as measured by both data transfer and total HTTP requests). The links below are to the host provided statistics, charts of growth and to the code itself, which is written in standard C++. In order to use the code, you will need to tailor it for your own purposes. Identify unique visitors, classify visits by hour of day, gather statistics for downloads of exe files. Report on visits to web pages, ranked by popularity. Quantify search engine referrals by the major engines (Google, Yahoo, MSN). Program code, documentation with screen shots is available as a “setup” file for downloading and installation.

|