git 2.9 brings new features that make reviewing changes easier. Everyone should set these configuration options to enable better git diffs.
656x Faster JSON Parsing in Python with ijson
I identified a performance bottleneck in my team's Python code. It used the ijson package in a naive way. It's possible to achieve much faster JSON parsing.
Preserve bash history across multiple terminals and sessions
Is it possible to configure the terminal to preserve bash history? Yes, it's easy to configure bash so that it preserves history across sessions and tabs.
No meta description has been specified
How to set meta description in WordPress and satisfy the "No meta description has been specified, search engines will display copy from the page instead" warning
Datathon For Diabetes in Boston
This weekend I'm at the Datathon for Diabetes in Boston. The goal is to use publicly available data to generate an insightful analysis of diabetes.
Ottawa: Data Day 3.0 at Carleton
On Tuesday March 29, I'll be demoing Data Scientist Workbench (DSWB) at Data Day 3.0 in Ottawa for the Carleton University Institute of Data Science.
Spark Summit East 2016
Next week I'll be demoing Data Scientist Workbench at Spark Summit East (official site) in New York. Polong Lin will be there with me. Come by the expo floor next Wednesday and Thursday and chat with us.
Datapalooza Seattle on Feb 9-11
On February 9 through 11, I'll be mentoring hackers and budding data scientists at Galvanize during Datapalooza Seattle. It should be a great conference covering topics like things like machine learning, natural language processing, and data engineering infrastructure.
Intro to #datascience and #spark at #ibminsight 2015
I'll be teaching two hands-on labs at Insight 2015 in Las Vegas: LCD-3459 Introduction to Data Science Data science is a very popular job profile and in great demand in a wide variety of industries. You no longer need a Ph.D. in mathematics or statistics to become a data scientist. Any data professional can upgrade … Continue reading Intro to #datascience and #spark at #ibminsight 2015
Setting up a new Macbook
I've just migrated to a new Macbook Pro as my primary work machine. As part of setting it up, I installed the following: Caffeine to prevent it from going to sleep when I don't want it to go to sleep BetterTouchTool so that I can middle-click (three finger tap) to close tabs and paste in … Continue reading Setting up a new Macbook