Fast Bulk Inserts into SQLite

Background Sometimes it’s necessary to get information into a database quickly. SQLite is a light weight database engine that can be easily embedded in applications. This will cover the process of optimizing bulk inserts into an SQLite database. While this article focuses on SQLite some of the techniques shown here will apply to other databases. [...]

Posted on August 10, 2010 at 10:11 am by Joe · Permalink · Leave a comment
In: Uncategorized · Tagged with: , , ,

Nightly Benchmarks: Tracking Results with Codespeed

Background Codespeed is a project for tracking performance. I discovered it when the PyPy project started using Codespeed to track performance. Since then development has been done to make its setup easier and provide more display options. Anyway, two posts ago I talked about running nightly benchmarks with Hudson. Then in the previous post I [...]

Posted on July 19, 2010 at 10:16 am by Joe · Permalink · Leave a comment
In: Uncategorized · Tagged with: , ,

Nightly Benchmarks: Setting up Hudson

For some projects, finding out about performance regressions is important.  I’m going to write a two part series about setting up a nightly build machine and displaying the generated data.  This part is going to cover installation of Hudson, and getting the benchmarks running nightly. I decided to give Hudson a try because I had [...]

Posted on April 8, 2010 at 9:36 am by Joe · Permalink · Leave a comment
In: Uncategorized · Tagged with: , ,

Sort Optimization (Part 2) with JDK 6 vs JDK 7

In part 1, I went over my first foray into the world of sorting algorithms.  Since then, I’ve had some other ideas on how to improve my quicksort implementation.  One idea that I had while originally working on the sorting algorithm, was to rework the partition function to take into account duplicate elements.  I had [...]

Posted on December 23, 2009 at 11:00 am by Joe · Permalink · Leave a comment
In: Uncategorized · Tagged with: , , ,

NLTK vs MontyLingua Part of Speech Taggers

This is a comparison of the part of speech taggers available in python. As far as I know, these are the most prominent python taggers. Let me know if you think another tagger should be added to the comparison. MontyLingua includes several natural language processing (NLP) tools. The ones that I used in this comparison [...]

Posted on March 28, 2009 at 10:23 pm by Joe · Permalink · 5 Comments
In: Uncategorized · Tagged with: , , ,