A python module to transform categorical variables to one hot encoded vectors. It particularly handles categorical variables of a dataset that cannot be fit into memory.
You can find a small tutorial in this blog post.
Any issue/bug reports or feature requests are greatly appreciated.