Category: Spotfire

Spotfire Filters Don’t Recognize Special Characters

It’s FRIDAY!!!  I know you are all looking forward to the weekend, but I learned something new again this week and wanted to share it. This little tip is related to filers and special characters.

List Box Filters & Special Characters

I noticed a long time ago that the Spotfire list box filter (and presumably text filters) don’t respond to special characters, as shown below.  I didn’t really know what to do about it until recently.Search without special characters Search with special characters

Solution

It turns out, the solution is really simple.  Just ignore the special character.  Don’t type it, and the search function in the filter will work.

Special characters solutoin

 

Data Science Toolkit Improvements

This week, I was able to test out the latest and greatest changes to the Ruths.ai Data Science Toolkit.  New options and features allow users to easily split test and training data sets prior to model building, as all good data scientists should!  This new functionality speeds up your analysis by making model build and evaluation faster and more efficient.  I worked up this video to demonstrate.

 Data Science Toolkit for Spotfire

The Data Science Toolkit brings the power of advanced data science to Spotfire.  Ruths.ai designed it with simplicity and efficiency in mind to support a wide range of analytics applications. This extension is coupled with comprehensive training that provides both beginner and experienced users a strong foothold in data science analysis.  The Data Science Toolkit is available to Premium subscribers.  Once deployed on your Spotfire server, quickly and easily access the toolkit via the Tools menu as shown below.  Find out more, including videos, at this link.

Data Science Toolkit menu

Please feel free to reach out to me or anyone else on the Ruths.ai team to learn more about this amazing product.  We love to talk about it!

 

Linear Regression, the simplest Machine Learning Model

Linear Regression models are the simplest linear models available in statistical literature. While the assumptions of linearity and normality seem to restrict the practical use of this model, it is surprisingly successful at capturing basic relationships and predicting in most scenarios. The idea behind the model is to fit a line that mimics the relationship between target variables and a combination of predictors (called independent variables). Multiple regression refers to only one target variable and multiple predictors. These models are popular not only for solving the prediction task but also for working as a model selection tools allowing to find the most important predictors and eliminate redundant variables from the analysis.

Read More

Real Estate Secrets: Hidden Trend Visualization

Everyone who has ever owned or lived in a house knows at least a little bit about the whims of the real estate market. Big houses cost more, neighborhood matters, proximity to basic services is great, age and style are important in some markets, you name it. But what is it that matters the most? This is a question that visualization can help us answer.

Read More

Normal Distribution Curve on a Visualization

I received an interesting request from a user that deserves sharing.  The user requested a visualization showing the curve of a normal distribution of data points.  Now, just to be clear, a visualization that shows the distribution of data points is a histogram, which looks like this:

Histogram

histogram

The histogram might vary a little bit if you change the number of bins being used, but it always has the continuous value along the X-axis and the (Row Count) on the Y-Axis.  However, the user didn’t want to see the bars of the histogram, just a curve that represented the histogram, which would look like this:

Normal Distribution Curve

curve only

This type of visualization is simple and easy to create in Spotfire using the following steps.

Creating the Visualization

  1. Add a bar chart
  2. Configure the X-Axis with the continuous value and the Y-Axis with (Row Count)
  3. On the X-Axis, click the down arrow on the axis selector and make sure the “Auto-Bin” box is checked.
  4. If needed, right click on the axis selector and choose “Number of Bins” to set the desired number of bins.
  5. In the legend, click on the color circle and color the bars the same color as the background (probably white).
  6. Go to Properties > Lines & Curves > Add > Gaussian Curve fit

BAM!  Done!  The Gaussian Curve fit is the normal distribution and represents the histogram as a curve.  If you combine the curve and the histogram, it looks like this:

curve and histogram

 

In the end, Spotfire had the functionality to quickly and easily meet the user’s needs!