Highly comparative time-series analysis with hctsa
  • Information about hctsa
    • Introduction
    • Getting started
    • Publications using hctsa
    • UMAP Projections
    • Related Time-Series Resources
    • List of included code files
    • FAQ
  • Installing and using hctsa
    • General advice and common pitfalls
    • Installing and setting up
      • Structure of the hctsa framework
      • Overview of an hctsa analysis
      • Compiling binaries
    • Running hctsa computations
      • Input files
      • Performing calculations
      • Inspecting errors
      • Working with hctsa files
    • Analyzing and visualizing results
      • Assigning group labels to data
      • Filtering and normalizing
      • Clustering rows and columns
      • Visualizing the data matrix
      • Plotting the time series
      • Low dimensional representation
      • Finding nearest neighbors
      • Investigating specific operations
      • Exploring classification accuracy
      • Finding informative features
      • Interpreting features
      • Comparing to existing features
      • Working with short time series
    • Working with a mySQL database
      • Setting up the mySQL database
      • The database structure
      • Populating the database with time series and operations
      • Adding time series
      • Retrieving from the database
      • Computing operations and writing back to the database
      • Cycling through computations using runscripts
      • Clearing or removing data
      • Retrieving data from the database
      • Error handling and maintenance
Powered by GitBook
On this page
  • Example 1: Compute a feature vector for a time series
  • Example 2: Analyze a time-series dataset

Was this helpful?

Export as PDF
  1. Installing and using hctsa
  2. Installing and setting up

Overview of an hctsa analysis

PreviousStructure of the hctsa frameworkNextCompiling binaries

Last updated 4 years ago

Was this helpful?

At its core, hctsa analysis involves computing a library of time-series analysis features (which we call operations) on a time-series dataset.

The basic sequence of a Matlab-based hctsa analysis is to: 1. Initialize a HCTSA.mat file, which contains all of the information about the set of time series and operations in your analysis, as well as the results of applying all operations to all time series, using ,

  1. These operations can be computed on your time-series data using . The results are structured in the local HCTSA.mat file containing matrices (that store the results of the computations) and the tables (that store information about the time-series data and operations), as described .

  2. After the computation is complete, are provided to understand and interpret the results.

Example 1: Compute a feature vector for a time series

As a quick check of your operation library, you can compute the full default code library on a time-series data vector (a column vector of real numbers) as follows:

x = randn(500,1); % A random time-series
featVector = TS_CalculateFeatureVector(x,false); % compute the default feature vector for x

Example 2: Analyze a time-series dataset

Suppose you have have a time-series dataset to analyze. You first generate a formatted INP_ts.mat input file containing your time series data and associated name and keyword labels, as described . You then initialize an hctsa calculation using the default library of features:

TS_Init('INP_ts.mat');

This generates a local file, HCTSA.mat containing the associated metadata for your time series, as well as information about the full time-series feature library (Operations) and the set of functions and code to call to evaluate them (MasterOperations), as described .

Next you want to evaluate the code on all of the time series in your dataset. For this you can simply run:

TS_Compute;

As described , or, for larger datasets, using a script to regularly save back to the local file (cf. sample_runscript_matlab).

Having run your calculations, you may then want to label your data using the keywords you provided in the case that you have labeled groups of time series:

TS_LabelGroups;

and then normalize and filter the data using the default sigmoidal transformation:

TS_Normalize;

A range of visualization scripts are then available to analyze the results, such as plotting the reordered data matrix:

TS_Cluster; % compute a reordering of data and features
TS_PlotDataMatrix; % plot the data matrix

To inspect a low-dimensional representation of the data:

TS_PlotLowDim;

Or to determine which features are best at classifying the labeled groups of time series in your dataset:

TS_TopFeatures;

Each of these functions can be run with a range of input settings.

TS_Init
TS_Compute
here
a range of processing, analysis, and plotting functions
here
here
here