1 Open Source Audit Software IIA District Conference Durham, NC 2/27/2009 Track 1 – Internal Audit Mike Blakley, EZ-R Stats, LLC.

Slides:



Advertisements
Similar presentations
The Complete Technical Analysis and Development Environment An attractive alternative to MATLAB and GAUSS - Physics World.
Advertisements

How to improve your Data Analysis Processes in your Web Application / ERP using RClass Juan Antonio Breña Moral
Design of Experiments Lecture I
Introduction to Matlab Workshop Matthew Johnson, Economics October 17, /13/20151.
Welcome to the Plant Breeding and Genomics Webinar Series Today’s Presenter: Dr. Heather Merk Presentation & Supplemental Files:
October Interfaces to NAG, R and Q Morten Kromberg Dyalog LTD Dyalog’13.
Introduction To Java Objectives For Today â Introduction To Java â The Java Platform & The (JVM) Java Virtual Machine â Core Java (API) Application Programming.
R Mohammed Wahaj. What is R R is a programming language which is geared towards using a statistical approach and graphics Statisticians and data miners.
Time Series Analyst An Internet Based Application for Viewing and Analyzing Environmental Time Series Jeffery S. Horsburgh Utah State University David.
Requirements Specification
INTERNET DATABASE Chapter 9. u Basics of Internet, Web, HTTP, HTML, URLs. u Advantages and disadvantages of Web as a database platform. u Approaches for.
© Prentice Hall CHAPTER 3 Computer Software.
Welcome to the Geostats Pty Ltd Presentation of QC Monitor Version 1.1
Types of software. Sonam Dema..
Air Quality Data Analysis Using Open Source Tools
WHAT IS PHP PHP is an HTML-embedded scripting language primarily used for dynamic Web applications.
SENG521 (Fall SENG 521 Software Reliability & Testing Software Reliability Tools (Part 8a) Department of Electrical & Computer.
What is R By: Wase Siddiqui. Introduction R is a programming language which is used for statistical computing and graphics. “R is a language and environment.
Program development & programming languages Chapter 13.
CSC 142 A 1 CSC 142 Introduction to Java [Reading: chapter 0]
© 2010 IBM Corporation © 2011 IBM Corporation September 6, 2012 NCDHHS FAMS Overview for Behavioral Health Managed Care Organizations.
 2008 Pearson Education, Inc. All rights reserved Introduction to Computers, the Internet and World Wide Web.
MySQL GUI Administration Tools Rob Donahue Manager, Distributed Systems Development May 7th, 2001 Rob Donahue Manager, Distributed Systems Development.
Manage Engine: Q Engine. What is it?  Tool developed by Manage Engine that allows one to test web applications using a variety of different tests to.
ROOT: A Data Mining Tool from CERN Arun Tripathi and Ravi Kumar 2008 CAS Ratemaking Seminar on Ratemaking 17 March 2008 Cambridge, Massachusetts.
Universal Linear Algebra API based on M4 Michael Fiero Isaac Asiamah.
Basics of Web Databases With the advent of Web database technology, Web pages are no longer static, but dynamic with connection to a back-end database.
3/8/00asp00 1 Active Server Pages from Microsoft Nancy McCracken Northeast Parallel Architectures Center at Syracuse.
Data Visualization using R
Avalanche Internet Data Management System. Presentation plan 1. The problem to be solved 2. Description of the software needed 3. The solution 4. Avalanche.
DIFFERENCE BETWEEN ORCAD AND LABVIEW
A L I MAM M OHAMMAD B IN S AUD I SLAMIC U NIVERSITY C OLLEGE OF S CIENCES D EPARTMENT OF M ATHEMATICS MATLAB 251 : MATH SOFTWARE Introduction to MATLAB.
Plenary meeting 2015 – Chania - Crete CASCADE Data Services Yusuf Yigini, Panos Panagos, Martha B. Dunbar Joint Research Centre - European Commission.
Administrative Software Chapter 7 Teaching and Learning with Technology.
Chapter 1 Introduction to SAS ® Enterprise Guide ®
Computer Programming A program is a set of instructions a computer follows in order to perform a task. solve a problem Collectively, these instructions.
1 The Software Development Process  Systems analysis  Systems design  Implementation  Testing  Documentation  Evaluation  Maintenance.
BLU-ICE and the Distributed Control System Constraints for Software Development Strategies Timothy M. McPhillips Stanford Synchrotron Radiation Laboratory.
EPICS Direction to Support Large Projects and Incorporate New Technology Leo R. Dalesio 09/21/99.
1 3. Computing System Fundamentals 3.1 Language Translators.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
PHP Features. Features Clean syntax. Object-oriented fundamentals. An extensible architecture that encourages innovation. Support for both current and.
1 Comparison of AD (Active Data) and ACL (Audit Command Language) Comparison of AD (Active Data) and ACL (Audit Command Language) January 27, 2011
CERN - IT Department CH-1211 Genève 23 Switzerland t DB Development Tools Benthic SQL Developer Application Express WLCG Service Reliability.
Teaching and Learning with Technology to edit Master title style  Allyn and Bacon 2002 Teaching and Learning with Technology lick to edit Master title.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Intermediate 2 Software Development Process. Software You should already know that any computer system is made up of hardware and software. The term hardware.
WRITING REPORTS Introduction Section 0 Lecture 1 Slide 1 Lecture 6 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall 2015.
1 Topic# 7 – Auditing with Technology Readings, Chapter 10 A – COMPUTERIZED AUDIT TOOLS –Electronic Spreadsheets –Automated Working Papers –Generalized.
ECE 103 Engineering Programming Chapter 55 C Math Library Herbert G. Mayer, PSU CS Status 6/4/2014 Initial content copied verbatim from ECE 103 material.
Application Software System Software.
©2016 Pearson Education, Inc. Upper Saddle River, NJ. All Rights Reserved. CSC 110 – INTRO TO COMPUTING - PROGRAMMING Overview of Programming.
TOPIC 7.0 LINUX SERVICES AND CONFIGURATION. ROOT USER Root user is called “super user” because it has power far beyond those of mortal user. As root,
1 The Software Development Process ► Systems analysis ► Systems design ► Implementation ► Testing ► Documentation ► Evaluation ► Maintenance.
Statistics and Probability Theory Lecture 01 Fasih ur Rehman.
Easy Integration of Third Party Applications with Laserfiche Using Affinity™ IS116 Bryant Biorn and Floid Gilbert.
1 Programming and problem solving in C, Maxima, and Excel.
Pinellas County Schools
With the support of the LPP programme of the European Union 1 This project has been funded with support from the European Commission. This publication.
Software Development Languages and Environments. Computer Languages Just as there are many human languages, there are many computer programming languages.
A quick guide to other statistical software
Fundamental of Databases
Distributed Control and Measurement via the Internet
Introduction to Visual Basic 2008 Programming
MATLAB Basics Nafees Ahmed Asstt. Professor, EE Deptt DIT, DehraDun.
Introduction to R Programming with AzureML
R Programming.
PHP / MySQL Introduction
Waikato Environment for Knowledge Analysis
Web Application Development Using PHP
Presentation transcript:

1 Open Source Audit Software IIA District Conference Durham, NC 2/27/2009 Track 1 – Internal Audit Mike Blakley, EZ-R Stats, LLC

2 Objectives 1. Open source audit software – advantages / disadvantages 2. Audit software functionality of four major software packages 3. SQLite - application in various audit areas

3 Objectives (cont’d) 4. RAT-STATS - random sampling 5. "R" system and its applications 6. Cephes - basic functionality 7. Excel   open source software

4 What is open source software? Source and binaries Languages Maintained by various persons Support / development - volunteer basis Licensing - GPL, Public Domain, etc.

5 Advantages 1. Transparency 2. Portability 3. Lower cost

6 Disadvantages 1. May require additional expertise 2. No slick front-end 3. Plain packaging 4. Support?

7 Objectives 1. Open source software -, advantages/disadvantages Next topic: Four Major Packages

8 Four major packages 1. SQLite - database system 2. RAT-STATS - random sampling system 3. R - library of statistical and plotting routines 4. Cephes - mathematical and statistical routines

9 How Excel fits in Audit tests on data in SQLite RAT-STATS  -  Excel workbooks R has an Excel interface Run R scripts from Excel Cephes routines can be called directly from Excel

10 Recap of objectives 1. Open source software -, advantages/disadvantages 2. Four major software packages Next topic is SQLite

11 Overview Developed in North Carolina! Largest number of database installations Public domain Standards compliant - SQL92 Very fast, written in “C” Zero installation SQLite

12 Example Audit uses Sample planning Population statistics Identification of duplicates Match/merge Benford's Law Same, same, different Data stratification

13 Advantages Cost effective - fast database No license cost Simple to install Portable Standards compliant

14 Disadvantages Doesn't have every "bell and whistle" Doesn't support every functionality Basic system is “command line”

15 SQLite Front Ends Excel SQLite browser Others

16 Specific audit applications White paper available which explains many of the topics Article in EDPACS, June 2008

17 How to load data Load using manual "scripts" Load with free software Import from Excel, Access, text files

18 Target audience Auditors Audit Managers Business Analysts Researchers Anyone working with large data volumes

19 Screen Shots of SQLiteBrowser 1. Identification of duplicates 2. “Drill down” (using where clause) 3. Population subtotals and basic statistics Public domain SQLite Database Browser

20 Identification of Duplicates

21 “Drill down” with where clause

22 Population Statistics

23 More information SQLite site – EZ-R Stats – SQLite browser

24 Wrap up Objective 3 What is SQLite? What audit areas can it be used? Data import Next topic is Random Sampling

25 RAT-STATS Federal HHS in San Francisco, with assistance from several universities Comprehensive Widely used in the health care industry Has withstood court challenges Are others, such as EZ-Quant (DOD)

26 Major functional areas 1. Random number generation 2. Sample size determination 3. Attribute sampling 4. Variable sampling 5. Types of sampling stratified unrestricted other

27 How it works Windows based (no Mac or Linux) Simple to install Some documentation Works with Excel, Access and text files

28 Advantages Comprehensive Withstood court challenges as to validity Does all the computations Provides basic documentation for work- papers Easy to install No license cost

29 Disadvantages Only certain confidence levels Little transparency (FOIA) Support?

30 Screen Shots 1. Random numbers 2. Variable sampling

31 Random numbers

32 Variable sampling

33 Wrap up Objective 4 What is RAT-STATS? Audit Areas Random numbers Attribute sampling Variable sampling Next topic is R

34 World-wide development Statisticians College Professors Library of statistical routines Extensive plotting and charting capabilities R is `GNU S’

35 Major functional areas 1. Statistical computing 2. Graphics 3. Linear regression and modeling 4. Statistical tests 5. Time series analysis 6. Data Classification

36 How it works Windows, Mac or Linux Relatively simple to install Extensive documentation Works with Excel, Access text files many databases (including SQLite)

37 Audit areas Excellent capabilities for regression Does step-wise regression (quite costly in other packages) Sample planning Population statistics Charting/plotting as part of audit planning

38 Advantages Comprehensive Good charting and plotting capabilities Extensive statistical functions Easy to install No license cost

39 Disadvantages User interface Fairly steep learning curve Support?

40 Screen Shots 1. Stepwise regression 2. Plot - confidence/precision intervals

41 Stepwise regression

42 Confidence Intervals

43 Wrap up Objective 5 What is R? What audit areas can it be used to address Next topic is Cephes

44 Cephes Federal Department of Energy at Oak Ridge Laboratories Library of mathematical and statistical routines (400+) Adaptation of earlier versions in FORTRAN Translated into C and Visual Basic Highly reliable and extensively tested

45 Major functional areas 1. Statistical computing 2. Mathematical computations 3. Probability

46 How it works Windows only Relatively simple to install Extensive documentation Works as stand alone routines or can be called from Excel

47 Audit areas Sample calculations Random number generation Sample planning Population statistics

48 Advantages Reliable, extensive testing (IEEE) Extensive statistical functions Easy to install No license cost

49 Disadvantages Support ?

50 Example of probability functions Chi square distribution Complemented Chi square Inverse Chi square Normal distribution Inverse normal distribution Poisson distribution Inverse Poisson distribution Student's t distribution

51 Example of Arithmetic and Algebraic functions Square root Long integer square root Cube root Evaluate polynomial Round to nearest integer value Truncate upward to integer Truncate downward to integer Absolute value

52 Screen shots 1. Calculations with Excel VBA 2. Plot with confidence/precision intervals

53 Calculations with Excel VBA

54 Plot with confidence/precision intervals

55 Wrap up Objective 6 What is Cephes? Useful for evaluation of random samples, linear regression, etc. Next topic is Excel as a platform

56 Excel as an audit platform Extensive capabilities, generally underused Can be integrated with open source software ActiveX Data Objects (ADO) Visual Basic for Applications (VBA) Calling external routines COM Servers

57 ActiveX Data Objects AuditNet "End User Database Access Using Excel" Example is use of SQLite

58 Visual Basic for Applications Very extensive capabilities Entire applications written in VBA Powerful audit tool Example library

59 Calling external routines Can be used to build scripts Then executed by external applications Excel - Shell command Provides ability to perform a variety of tasks, such as charting and plotting using R running database queries

60 COM Servers Makes routines directly accessible to Excel using "CreateObject" Cephes library is an example Many free com servers available Simplifies Excel by "compartmentalizing" program logic

61 Advantages Already widely used Many "built-in" capabilities Macro language VBA widely understood

62 Disadvantages Learning curve Support?

63 Wrap up Objective 7 Excel as an audit platform Uses include: database queries, running R, complex stat calculations

64 Questions? Contact info: View this presentation