The United States reported their Retail trade statistics the other day (July 16 2012). And quite rightly this picked up a bit of press where it was reported that they fell for the third month in a row. As is usually the case, these falls refer to changes in the seasonally adjusted estimates. And three falls in a row is starting to look like something bad for all those retailers (and perhaps the wider economy).But given that the seasonally adjusted estimates still, by definition, contain a degree of volatility and also an underlying trend, we can go one better and derive our own smoothed estimate of the seasonally adjusted estimates. This will help us cut through the volatility and check out the underlying direction of the data.We derived a trend estimate in the following way.
- Downloaded the data from here: http://www.census.gov/retail/marts/www/timeseries.html
- Plugged them into R (statistical package)
- Applied a 13 term Henderson filter to the full seasonally adjusted data to generate a trend estimate
- Used the ggplot2 package in R, which produces very nice plots (but can take some effort to get the data into the right format, e.g. a dataframe with all the right bits)
- And we get the following picture with a trend line…
|Nov 2011||Dec 2011||Jan 2012||Feb 2012||Mar 2012||Apr 2012||May 2012||Jun 2012|
Other approaches could of course be used with different filters being applied and these would give slightly different results depending on the type and length of the filter used. It would have also been more useful if there was an official estimate of the trend as it would’ve saved some time as this is can be produced as a by-product of the seasonal adjustment process. One good thing about the data that can be downloaded from the census site is the availability of the seasonal factors, and also the sampling variability of the estimates. This is something you don’t often see being produced. So this is a big plus to have.Also note that there are a few different estimates floating around in the dataset, particularly: advance estimates, preliminary estimates, revised estimates, and then suppressed and also not available. So this can potentially be a bit confusing as each of these estimates will have different characteristics. This is something to keep in mind if you’re grabbing the latest information from any data source is that it can often be revised as new data becomes available. Actually – this is really a good thing as it means that we at least have the best, latest and most up-to-date information.