A lot of statistics on Indian economy and society are available online. These are scattered across various websites and are mostly provided without any information on the sources, data definitions and limitations of these statistics.
This website was conceived as a platform to host statistics on different aspects of the economy and society. The [...]
I recently had to prepare a beamer presentation in hindi/devanagari. I usually use emacs-orgmode with a lot of R source code embedded in it to prepare my beamer presentations. To adapt the entire setup to work with devanagari, this is what I needed to do.
Make orgmode export to latex using xetex rather than [...]
United Nations Economic Commission for Europe has published these two documents very interesting documents.
Making Data Meaningful, Part I
Making Data Meaningful, Part II
I am organising a group at CESP, JNU that will produce scripts using open source tools for reading NSSO and ASI data. The idea is to release these scripts under an open source license. Any researcher who purchases NSSO/ASI data will be able to use these scripts to process them.
We will use these scripts [...]
Vacancy announcement is here.
PROOFINESS: The Dark Arts of Mathematical Deception
An interesting book review in NYTimes. Hope to get hold of the book sometime.
I discovered a new, very useful, R function yesterday: ave.
This is what it does: “Subsets of ‘x‘ are averaged, where each subset consist of those observations with the same factor levels.”
But interestingly, you can use any function other than average. The output of that function is set against each observation.
I wanted [...]
Two students of social sciences who had so far used computers for limited, specific tasks recently asked me for some kind of a course outline that they should cover to pick up basic skills for using a spreadsheet software. I prepared the list and thought it may interest some others. Hence this post.
If a model is estimated using the following code: lm(y~x1+x2)->p
1. bptest(p) does the Breuch Pagan test to formally check presence of heteroscedasticity. To use bptest, you will have to call lmtest library.
2. If the test is positive (low p value), you should see if any transformation of the dependent variable helps you eliminate [...]
Making Data Meaningful: A guide to writing stories about numbers
The blurb: The guide is intended as a practical tool to help managers, statisticians and media relations officers use text, tables, graphics and other information to bring statistics to life.
Get the pdf version directly from here.
sapply(a,class) gives type of field (character, numeric, or factor) for each variable in the data frame a.
?rollmean (package=zoo)?rollmedian (package=zoo)?runmed (package=stats)
: The levels of a factor are re-ordered so that the level specified by ‘ref’ is first and the others are moved down. This is useful for ‘contr.treatment’ contrasts which take the first level as the reference.
In my previous post on this issue, I had presented a code that made weighted boxplots and annotated them with boxplot statistics and the mean values. The problem with that code was that it printed these annotations right on the vertical axes of the boxplots. Also, a relatively minor problem was that, when the values [...]
Here are some useful tips on the issue.