Zip Code Magic
A while back I got to thinking about how we deal with zip codes as programmers. As a backend developer with a preference for PostgreSQL I was always a bit stunned that Postgre didn't have an explicit datatype for zip codes. Everything gets a type in Postgre, from UUIDs to geometric shapes used for geo fencing. You may be wondering why I would make a stink about something so simple. The USA uses 5 digit numbers to refer to specific regions of the country by a unique name. Store it as an integer and call it a day right? Well you could, and it would work most of the time, and pretty well. Thing is, Zip code information provides much more than we usually allow it.
The first digit of a zip code specifies a region. This digit can be 0-9. That's right, zip codes in the US Territories, and the North Eastern United States are less than 5 digits long if you store them as integers. You could always store them as integers and then pad the front with zeroes if you ever need to print them, but there are some advantages to storing the zip code as a string, depending on application.
A Zip code as a string of 5 digits allows us to leverage the SQL LIKE clause. This comes in handy when looking for zip codes that match a given prefix. While the first digit will correspond to a region, digits 1-3 or the 3 digit zip code refers to a "Sectional Facility Centers". These facilities often include zip codes from just one city, and rarely, are cities split across centers. It can happen though, and it does. Three different sectional facility centers serve New York City. The next two specify specifically the zip code region, and the final 4 digits (if they are added) refer to a specific address.
Since zip code data is public, I dug up the latest mammoth CSV file from the government archives and put together a python package for quickly finding zip code info based on full or partial zip codes. The information was last updated in 2012 so it is not perfect -- I hope to find a newer one soon. The package I built is incredibly simple, but it is quick and powerful when used properly. Right now, by supplying a full or partial 5 digit zip code, you can fetch "zip code objects" which can be used to give information about population, wages, taxes, geospatial positioning and more. This package is utilizes no API and so is applicable an fast on both the front and backend. I plan to add more functions to increase its usefulness.
$ pip install zipcode $ python >>> import zipcode >>> cbus_zips = zipcode.islike('432') #=> list of zip code objects in '432' sectional facility. >>> zipcode.isequal('44102').location_text #=> 'Cleveland, OH'
Update 9/14/2015: I've added some new functionality. The docs are up on http://pythonhosted.org/zipcode/.
(c) 2018 email@example.com