Month: July 2017

Differences Between Bookmarks, Tags & Lists

Bookmarks, Tags & Lists

  • All three of these Spotfire features are used to capture pieces of an analysis, but have you ever wondered when to use one versus the other?
  • Have you ever wondered about the subtle differences between them?

Read More

Guest Spotfire blogger residing in Whitefish, MT.  Working for SM Energy’s Advanced Analytics and Emerging Technology team!

CRISP DM: Deployment

Welcome to the next installment of our Analytics Journey, which explores how we at Ruths.ai apply the CRISP-DM method to our Data Science process. Previously, we looked at an overview of the methodology as a whole as well as the Business UnderstandingData UnderstandingData Preparation, Modeling, and Evaluation stages.  Next, we examine the final stage:  Deployment.

The.  Final.  Stage.  Now, we just have to turn this thing on and reap the rewards, right?

      

Unfortunately, Deployment does not just happen with the push of a George Jetson button.

Read More

Jason is a Junior Data Scientist at Ruths.ai with a Master’s degree in Predictive Analytics and Data Science from Northwestern University. He has experience with a multitude of machine learning techniques such as Random Forest, Neural Nets, and Hidden Markov Models. With a previous Master’s in Creative Writing, Jason is a fervent believer in the Oxford comma.

Using Support Vector Machines in Spotfire

(Image Source: opencv.org)

Support Vector Machines (SVMs) is one of the most popular and most widely used machine learning algorithms today. It is robust and allows us to tackle both classification and regression problems. In general, SVMs can be relatively easy to use, have good generalization performance, and often do not require much tuning. Follow this link for further information regarding support vector machines. To help illustrate the power of SVMs, we thought it would be useful to go through an example using a custom template we have created for SVMs.

Read More

Emanuel holds a Master of Science Degree in Data Science and Analytics from the University of Oklahoma and brings years of experience in production engineering.

Using the “spTimer” Package to Model Spatio-Temporal Data in R

The “spTimer” package uses three Bayesian models to fit Spatio-Temporal Data. The data may be given at sparse spatial stations, where observations at each station are considered time series. The package can model the residual spatio-temporal variation to measure uncertainty. It also gives flexibility to customize covariance function selection, the hyper-parameters of the prior distributions and the tuning parameters for the implemented MCMC algorithms.

Read More

Marking, Filtering, and Limiting, Oh My!

To veteran Spotfire users, the distinction between Marking, Filtering, and Limiting might seem obvious; however, to an uninitiated member, some similarities might cause confusion. In fact, one often can obtain the same exact result using combinations of Marking, Filtering, and Limiting. All the methods allow the user to make a click in one area that affects other visualizations. All the methods in their own way highlight a subset of the data.

Read More

Jason is a Junior Data Scientist at Ruths.ai with a Master’s degree in Predictive Analytics and Data Science from Northwestern University. He has experience with a multitude of machine learning techniques such as Random Forest, Neural Nets, and Hidden Markov Models. With a previous Master’s in Creative Writing, Jason is a fervent believer in the Oxford comma.

Missing Value Imputation with Data Augmentation in R

Incomplete data is a problem that Data Scientists face every day. Most common practices vary from complete deletion of the observations with missing values, substitution by a fixed value, or performing imputation using statistics like the mean or median. Since these approaches have limitations on capturing the structure of the data, scientists have developed more sophisticated methods.

Read More

CRISP-DM: Evaluation

Welcome to the next installment of our Analytics Journey, which explores how we at Ruths.ai apply the CRISP-DM method to our Data Science process. Previously, we looked at an overview of the methodology as a whole as well as the Business UnderstandingData Understanding, Data Preparation, and Modeling stages.  Next, we examine the Evaluation stage.

Read More

Jason is a Junior Data Scientist at Ruths.ai with a Master’s degree in Predictive Analytics and Data Science from Northwestern University. He has experience with a multitude of machine learning techniques such as Random Forest, Neural Nets, and Hidden Markov Models. With a previous Master’s in Creative Writing, Jason is a fervent believer in the Oxford comma.