About dmartin

This author has not yet filled in any details.
So far dmartin has created 17 blog entries.

Awesome AWS Now on GitHub!

A curated list of awesome Amazon Web Services (AWS) libraries, open source repos, guides, blogs, and other resources [...]

By |October 25th, 2015|Categories: Cloud, Data, GitHub|Comments Off on Awesome AWS Now on GitHub!

SAWS, A Supercharged AWS CLI, Now on GitHub!

Interactive command line interface that aims to supercharge the AWS CLI with features focusing on improving ease-of-use and increasing productivity. Under the hood, SAWS is powered by the AWS CLI and supports the same commands and command structure [...]

By |September 20th, 2015|Categories: Cloud, Data, GitHub|Comments Off on SAWS, A Supercharged AWS CLI, Now on GitHub!

Dev Setup Now on GitHub!

Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based defaults for Mac OSX [...]

By |August 9th, 2015|Categories: Cloud, Data, GitHub, Mobile|Comments Off on Dev Setup Now on GitHub!

Interactive Coding Challenges Now on GitHub!

Interactive, test-driven Python coding challenges (algorithms and data structures). Challenges focus on algorithms and data structures that are typically found in coding interviews or coding competitions [...]

By |July 11th, 2015|Categories: GitHub|Comments Off on Interactive Coding Challenges Now on GitHub!

Data Science Python Notebooks Now on GitHub!

Continually updated Data Science Python Notebooks: Spark, Hadoop MapReduce, HDFS, AWS, Kaggle, scikit-learn, matplotlib, pandas, NumPy, SciPy, and various command lines […]

By |June 6th, 2015|Categories: Cloud, Data, GitHub|Comments Off on Data Science Python Notebooks Now on GitHub!

Python Hadoop MapReduce: Analyzing AWS S3 Bucket Logs with mrjob

mrjob lets you write MapReduce jobs in Python 2.5+ and run them on several platforms. You can: Write multi-step MapReduce jobs in pure Python, test on your local machine, run on a Hadoop cluster, run in the cloud using Amazon Elastic MapReduce (EMR) [...]

By |May 17th, 2015|Categories: Cloud, Data|Comments Off on Python Hadoop MapReduce: Analyzing AWS S3 Bucket Logs with mrjob

Tableau 9 Features: Impressions from Beta

With the final beta in the hands of testers, I thought I would give a quick overview of my favorite features in Tableau 9 [...]

By |March 24th, 2015|Categories: Data|Comments Off on Tableau 9 Features: Impressions from Beta

Setting Up Splunk for AWS

I recently hooked up Splunk with AWS to search, monitor, and analyze log files. Splunk indexes data on read and allows for super-fast searching and visualization. I like to think of Splunk as Google Search for log files with visualization built-in [...]

By |February 1st, 2015|Categories: Cloud, Data|Comments Off on Setting Up Splunk for AWS

A Brief Introduction to R Unit Testing with test_that

Testing is a vital part of software development. I've always been a fan of Test Driven Development (TDD) and use nose for my python data analysis projects. I've recently hooked up test_that to my R Snippets Repo [...]

By |January 18th, 2015|Categories: Data|Comments Off on A Brief Introduction to R Unit Testing with test_that

Website Redesign and Jekyll Mirror

I had some free time over the Christmas to New Year's break and completed overhauling my personal website donnemartin.com. I've also started to build up its mirror site donnemartin.github.io powered by Jekyll. I love that I can use my existing developer tools to generate content (SublimeText, Terminal, GitHub). [...]

By |January 1st, 2015|Categories: Cloud, Data, Mobile|Comments Off on Website Redesign and Jekyll Mirror