Archive for the ‘Uncategorized’ Category

AI World Conference Moderator

Posted: October 6, 2016 in Uncategorized

Aai_world1s Managing Editor for insideBIGDATA.com, I’ve been invited to moderate a panel for AI World Conference & Expo in San Francisco, November 7-9, 2016. The topic of the panel is “High Performance Computing for the AI Ecosystem,” and includes several industry luminaries from high-profile companies like NVIDIA, Baidu Research, and Facebook. I really enjoy attending shows like this one, at the bleeding-edge of technology innovation. I always learn a lot and it serves to help direct my own research and consulting work.

ai_world2

AI World is the nation’s largest AI business conference with more than 90 speakers (including 75+ CxO’s) and 40+ sponsors and exhibitors to help attendees learn how to successfully harness intelligent technologies to build competitive advantage, drive new business opportunities and accelerate innovation efforts.

Advertisements

lafashion_paperOver the past year, I’ve been working on a new research project that culminated with a whitepaper in support of the Los Angeles fashion industry – “Los Angeles Fashion Industry: A Data Science Perspective.” The work was done in collaboration with LA’s premiere fashion mart, The New Mart. The paper is in response to numerous stories found in the local apparel industry trade press that held the sentiment that LA fashion had seen better days and was on a downward slide. We decided to use privately procured and publicly available data sets to find a single truth surrounding this matter. The whitepaper’s position is quite the opposite – LA fashion is a Los Angeles mainstay and continues on an upward trajectory. See for yourself by downloading the paper HERE.

 

DemoDay2_DanI was pleased to present last evening at the Grid110 Demo Day hosted by The New Mart, LA’s premiere fashion mart in DTLA. Grid110 is a new start-up business accelerator in partnership with the Office of the Los Angeles Mayor. My topic was putting a new face on the LA apparel industry using data science methodologies. My analysis used data sets from the Los Angeles Open Data repository. I’m a big proponent of government open data resources with the goal of improving the lives of citizens in ever more data-driven cities.

My presentation is provide below. Check out the data visualizations especially the geospatial data analysis clusters showing business starts for the past 10 years across the various industry codes that constitute the LA apparel industry. Moving forward, I will be collaborating with Grid110 in 2016 to publish a new Fashion Tech industry report, develop a new Shiny app, and collect data points for a new Fashion Tech sector database. Exciting stuff!

 

insideBIGDATA Guide to Retail

Posted: September 9, 2015 in Projects, Uncategorized

insideBIGDATA_Guide_RetailI’d like to announce the availability of a new technology guide that I was contracted to research, develop and write — “insideBIGDATA Guide to Retail” sponsored by Dell and Intel. This guide is directed toward line of business leaders in conjunction with enterprise technologists with a focus on the above opportunities for retailers and how Dell can help them get started. The guide also will serve as a resource for retailers that are farther along the big data path and have more advanced technology requirements.

I was excited about writing this guide since I spend a lot of my time as a practicing data scientist in the fashion industry where I build machine learning solutions to enhance brand awareness.

You can download a copy of the guide HERE.

MachineLearning_book_cover_smallI’m very proud (and relieved) to announce that my year-long+ book project is finally done! “Machine Learning and Data Science: An Introduction to Statistical Learning Methods with R” is available from Technics Publications. The book provides an introduction to the entire data science process, highlighting the ways that machine learning can be used to solve business problems. Both supervised and unsupervised statistical learning techniques are included. The R statistical programming language is used throughout. Here is the table of contents:

Introduction

Chapter 1: Machine Learning Overview

Chapter 2: Data Access

Chapter 3: Data Munging

Chapter 4: Exploratory Data Analysis

Chapter 5: Regression

Chapter 6: Classification

Chapter 7: Evaluating Model Performance

Chapter 8: Unsupervised Learning

The book is perfect for newbies just entering the data science field who wish to quickly get up to speed with the technology. I plan to use the book for the introductory courses I teach for corporations and universities. You can pre-order the book on Amazon HERE. You can find all the R code used in the book at this GitHub repo.

 

Hadoop Summit 2015

Posted: May 18, 2015 in Events, Uncategorized

hadoop_summit_logoI am pleased to report that I will be attending the upcoming Hadoop Summit 2015 in San Jose on June 9-11. I’ll be the guest of Hortonworks (host of the show and one of the leading Hadoop distributions) and will be covering the conference for insideBIGDATA. Check out insideBIGDATA’s new Hadoop 101 learning channel where I shall publish many of the new presentations I find at the Hadoop Summit.

Toyota_Financial_ServicesI just completed a short Boot-Camp style corporate training gig for Toyota Financial Services. The 3 full-day session was organized by UC Irvine Extension as part of their Data Science program. I had a blast with the Toyota group consisting of Big Data and BI managers, analysts, IT personnel, and programmers. It was a very insightful group with a sincere desire to learn R, data science and machine learning. I came away quite impressed with Toyota (too bad they’re moving to Texas).