IBM SPSS Modeler - Data mining, text mining, predictive analysis

Solve your toughest challenges with data mining

IBM SPSS Modeler Professional and IBM SPSS Modeler Premium is available in both a desktop-based client deployment as well as a client/server deployment model. The features shown in the chart below are accessed from the client. IBM SPSS Modeler Server, available as both a Professional and Premium edition, provides server-based processing and performance enhancement as well as additional features such as batch processing, SQL pushback and in-database mining.

	Modeler Professional	Modeler Premium
Features
Create a wide range of interactive graphs with automatic assistance	X	X
Use visual link analysis to see the associations in your data	X	X
Interact with data by selecting regions or items on a graph and view the selected information; or select key data for use in analysis	X	X
Access IBM SPSS Statistics tools directly from interface	X	X
Read from and write to operational data from a variety of operational datasources such as IBM DB2®, Oracle®, Microsoft SQL Server™, Informix®, Neoview, Netezza, mySQL (Sun) and Teradata.	X	X
Read from and write to data views and metadata stored in Cognos® 8 Business Intelligence	X	X
Import from and export to delimited and fixed-width text files, any IBM SPSS Statistics file, SAS, IBM Data Collection data sources, or XML	X	X
Data-cleaning options that remove or replace invalid data, automatically impute missing values and mitigate for outliers and extremes	X	X
Apply automatic data preparation to interrogate and condition data for analysis in a single step	X	X
Field filtering, naming, derivation, binning, re-categorization, value replacement, and field reordering	X	X
Record selection, sampling (including clustered and stratified sampling), merging (including inner joins, full outer joins, partial outer joins, and anti-joins), and concatenation; sorting, aggregation, and balancing	X	X
Data restructuring, partitioning and transposition	X	X
Extensive string functions: string creation, substitution, search and matching, whitespace removal, and truncation	X	X
RFM scoring: aggregate customer transactions to provide Recency, Frequency, and Monetary value scores and combine these to produce a complete RFM analysis.	X	X
Use interactive model and equation browsers and view advanced statistical output	X	X
Show relative impact of different data attributes on predicted outcomes with variable importance graphs	X	X
Combine multiple models or use one model to analyze a second model	X	X
Use automatic classification (both binary and numeric) and clustering models to eliminate need for selecting individual algorithm	X	X
Use Modeler’s Component-Level Extension Framework (CLEF) to build custom algorithms	X	X
Through the integration of IBM SPSS Statistics, use R to extend analysis options through customization	X	X
C&RT, CHAID & QUEST—Decision tree algorithms including interactive tree building	X	X
Decision List—Interactive rule-building algorithm	X	X
K-Means, Kohonen, Two Step, Discriminant, Support Vector Machine (SVM) — Clustering and segmentation algorithms	X	X
GRI—Generalized rule induction association discovery algorithm	X	X
Factor/PCA, Feature Selection—Data reduction algorithms	X	X
Regression, Linear, GenLin (GLM)—Linear equation modeling	X	X
Self-learning response model (SLRM)—Bayesian model with incremental learning	X	X
Time-series—Generate and automatically select time-series forecasting models	X	X
C5.0 decision tree and rule set algorithm	X	X
Neural Networks—Multi-layer perceptrons with back-propagation learning, and radial basis function networks	X	X
Support Vector Machines—Advanced algorithm with accurate performance for wide datasets	X	X
Bayesian Networks—graphical probabilistic models	X	X
Cox regression—calculate likely time to an event	X	X
Anomaly Detection—Detect unusual records through the use of a cluster-based algorithm	X	X
KNN – Nearest neighbor modeling and scoring algorithm	X	X
Apriori—Popular association discovery algorithm with advanced evaluation functions	X	X
CARMA—Association algorithm which supports multiple consequents	X	X
Sequence—Sequential association algorithm for order-sensitive analyses	X	X
Export models using SQL or PMML (the XML-based standard format for predictive models)	X	X
Extract text data from files, operational databases and RSS feeds (ie. blogs, web feeds)		X
Select native language extractor options for Dutch, English, French, German, Italian, Portuguese, Spanish or Japanese or translate virtually any language using Language Weaver (separately licensed)		X
Extract domain-specific concepts such as uniterms, expressions, abbreviations, acronyms, and more		X
Calculate synonyms using sophisticated linguistic algorithms and embedded or user-specified linguistic resources		X
Name concepts by person, organization, term, product, location, and other user- defined types		X
Extract non-linguistic entities such as address, currency, time, phone number, and social security number (SSN)		X
Use and customize pre-built templates and libraries for sentiment analysis, CRM, security and intelligence, market intelligence, life sciences, and IT		X
Leverage pre-packaged Text Analytics Packages (TAPs) for the most common business applications and create your own		X
Create clusters based on term co-occurrence using concept clustering algorithms, which provide an at-a-glance view of main topics and the way in which they are related		X
Intelligently group text documents and records based on content, using text classification algorithms –Enable advanced concept selection and deselection for use in predictive modeling		X
Use text based and visual reports to interrogate concept relationship, occurrence, frequency and type		X
Identify and extract sentiments (for example, likes and dislikes) from text in Dutch, English, French, German, and Spanish		X
Ability to read from IBM Classic Federation data sources	X	X

Learn more about the Modeler family