Close

R tidyverse, dplyr and ggplot2 at Tampa Bay Data Science Group

At Tampa Bay Data Science Group, this month Dr. Thomas Keller presented on intro to the R tidyverse: dplyr,ggplot2, and some Twitter conference data visualization. Two other good dplyr/ggplot resources is this blog post (https://rollingyours.wordpress.com/2016/07/19/dplyr-and-zika-epilogue/ ) and this recent JSM talk by Jim Horton (https://github.com/Amherst-Statistics/JSM2016-thinkwithR/blob/master/jsm2016-horton.pdf ) Here are links to the slides ( http://thomas-keller.github.io/talks/intro_ggplot_twitter_conf_20160808.pdf ) , R…

Share

Apache Spark on Azure HDInsight - My Talk @ Tampa Bay Data Science Group

Last week Microsoft has announced that Apache Spark on Azure HDInsight (Microsoft’s managed Hadoop and Spark cloud service) is now generally available. I spoke to Tampa Bay Data Science Group last night regarding Apache Spark on Azure HDInsight and the associated offerings. Spark for Azure HDInsight offers customers an enterprise-ready Spark solution that’s fully managed, secured, and highly available…

Share

A Workable Definition of Data Science

John Foreman in, Data Smart: Using Data Science to Transform Information into Insight works towards a A Workable Definition of Data Science, which is definitely an excellent read. To an extent, data science is synonymous with or related to terms like business analytics, operations research, business intelligence, competitive intelligence, data analysis and modeling, and knowledge extraction (also called knowledge…

Share

Data Science Hackathon

Online Data Science Hackathon - Apache Spark Maker Build This is a Global On-line Hackathon that Begins May 23, 2016 and Ends August 3, 2016.  Submissions Due August 3rd By 5:00 Pm Est. Build an Apache Spark application to address a real business problem or core concern related to customer care, marketing, risk management or…

Share

Exploring Spark with Data Science Work bench

Apache Spark is a general purpose cluster computing platform which extends map-reduce to support multiple computation types including but not limited to stream processing and interactive queries. Last week IBM's Moktar Kandil presented at the Tampa Hadoop and Tampa Data Science Group Joint meetup on the topic of exploring Apache Spark. Apache Spark for Azure HD-Insight Following are…

Share

State of Facial Recognition (Azure Face API et al) & Sentiment Analysis

Sam just wrote a precis on Why Facial Recognition Is the Next Big Thing in Marketing which outlines how brands are / can use the facial recognition to increase engagement, and therefore sales. From a machine learning and data science perspective, building algorithms which understand what one's face is really saying i.e. performing emotion analysis to find insight into purchasing patterns…

Share