Skip to content

File Analyzer Training Code4Lib 2014

Terry Brady edited this page Mar 3, 2014 · 24 revisions

Pre-conference Preparation Tasks

  • Install and build the File Analyzer (required): Installation instructions
  • Send Terry a quick note confirming that you were able to complete the installs. At the end of the pre-conference session, we will code a custom File Analyzer rule. In your email, indicate your level of experience/comfort programming in Java. This portion of the session will be tailored to the experience of the audience.
  • A Java IDE is recommended for last portion of the pre-conference. If you do not already have a Java IDE available, consider installing the Eclipse Standard Edition: https://www.eclipse.org/downloads/

Training Outline

  • File Analyzer Overview
  • Try it yourself
  • Demonstration of highly customized File Analyzer Rules
  • Your ideas for future customizations
  • Coding a File Analyzer rule

Overview Documentation

Demonstration of basic tasks

User documentation is available at the link listed above.

Try it yourself

Sample data files corresponding to these exercises will be provided at the start of the pre-conference session.

Exercises to try

  • Run "Count Files by Type" on the "01_Flash Drive Inventory" folder. ** Sort the results from highest count to lowest count. What file type occurs most frequently?
  • Run "Match by Name" on the "01_Flash Drive Inventory" folder.  **  Which file names have been duplicated? ** Remove your open tabs
  • Run "Match by Base Name"  ** on the PDF folder ** run it again on the Word Docs folder ** Which word document does not have a corresponding PDF?
  • Remove the tabs from all of your prior tests.
  • Run "Sort by Checksum" looking only at image files ** on the Checksum Tests folder.  ** run it again on the Checksum Tests2 folder.  **  Which files are not identical between the 2 folders? ** Remove the tab for your test on the Checksum Tests2 folder. ** Export the results from your first "Sort by Checksum" task as a tab-delimited file.  Export only the key and data fields. ** Import your checksum results using "Import Delimited File" ** Use the merge tool to compare your imported file to the results from your checksum test ** No differences should exist

Demonstration of Customized File Analyzer Rules

  • Counter compliant report validation
  • Output to Bursar processing
  • Invoice processing
  • Identify digital derivatives
  • ETD Processing

Discussion: Your ideas for future customizations

Coding a customization

Clone this wiki locally