1) Ripser applied to cyclo-octane molecule and optical image patches

Leader: Henry Adams, https://www.math.colostate.edu/~adams/

At this beginner’s table, we will use the Ripser software package to compute the persistent homology of point cloud datasets (including the cyclo-octane molecule and optical image patches); a beginner’s version of Ripser can be run in one’s browser (live.ripser.org) without downloading any software.

Software

- Ripser; use online at live.ripser.org or install from https://github.com/Ripser/ripser
- Code from the github site https://github.com/henryadams/Charleston-TDA-ML
- There are also many links to TDA software at https://www.math.colostate.edu/~adams/advising/ under “What applied topology software options do I have?”

2) Hypothesis testing for TDA

Leader: Jessi Cisewski-Kehe

We will be using R to investigate how to use persistent homology summaries to compare different sets of data. We will explore simple datasets composed of data sampled on circles, and then consider more complicated datasets with web-like spatially complex features (similar to the large-scale structure of the Universe or fibrin networks).

Related talk on Saturday morning: TDA Inference for Spatially Complex Data

Software

R
package TDA, from https://cran.r-project.org/package=TDA
More files are in folder Hypothesis Testing for TDA — Jessi Cisewski-Kehe within the Google Drive folder CBMS TDA Conference 2019 Resources

3) R package TDA

Leader: Jisu Kim, http://www.stat.cmu.edu/~jisuk/

This table will let the participants get familiar with the R package TDA. I will begin from the installation and follow the tutorial so that the participants can reproduce the topological data analysis results.

Software

R
package TDA, from https://cran.r-project.org/package=TDA

4) UMAP Development

Leader: Leland McInnes

The primary focus of this table will be on trying to get the latest version of UMAP ready for release. The sprint will focus on implementation of features, and writing documentation, especially use-cases demonstrating UMAP’s features.
Familiarity with Python is helpful. Documentation will be written using Jupyter notebooks.

Software

Python
Jupyter
JupyterLab
The 0.4dev branch of UMAP ( from https://github.com/lmcinnes/umap )

Installation assistance will be provided.

5) Bug hunt for Scikit-TDA

Leader: Nathaniel Saul

There are outstanding issues that could be resolved by someone with basic Python and TDA experience. An interesting issue to be resolved is to extend the functionality of persistence images.

Software

Python
scikit-tda which can be installed using the pip command “pip install scikit-tda”

6) Example Generation Hackathon in Scikit-TDA

Leader: Christopher Traile

The group will work on adding to the examples posted at the Scikit-TDA web site.
The software is written in Python and can be installed via pip; see above.

Software

Python
scikit-tda which can be installed using the pip command “pip install scikit-tda”

7) Introductory TDA in R with TDAstats

Leader: Raoul Wadhwa

We will begin by going through tutorials on computing and visualizing persistent homology with Vietoris-Rips complexes and discussing input formats depending on how users conceptualize them (e.g. distance matrix for network scientists, point cloud for topologists, etc.). We will then use real datasets (e.g. handwritten digits/letters) to see how TDA can be used to analyze them and perform classification tasks. Users can also bring their own datasets and learn how TDAstats can be used to effectively analyze them. Basic R knowledge would be helpful, but is not required; since this is a beginner table, I would be happy to provide R assistance for participants with programming experience in other languages.

Keywords: Vietoris-Rips, simplicial complex, TDAstats, R, beginner

Software

R
Rstudio (recommended, but not essential)
TDAstats package: install from within R using the command `install.packages(“TDAstats”)`

Notes on getting software: Python, R, and beyond

Many of the coding sprints use Python or R or both.

One way to get both Python and R, along with Jupyter notebooks, JupyterLab, and the integrated development environments (IDEs) Spyder for Python and RStudio for R, is to install the free, downloadable Anaconda Python/R distribution from https://www.anaconda.com/distribution/ with versions for macOS, Windows and Linux.

Alternatively, R can be got from its home site https://www.r-project.org and the CRAN repository mentioned there, and one can add RStudio, a popular IDE for R with a free downloadable version available from https://www.rstudio.com/products/RStudio/ again for Windows, macOS and Linux.

NSF-CBMS Conference and Software Day on Topological Methods in Machine Learning and Artificial Intelligence

May 13–17 and May 18, 2019. Department of Mathematics, College of Charleston, South Carolina

Coding Sprint Descriptions and Software Links

1) Ripser applied to cyclo-octane molecule and optical image patches

Leader: Henry Adams, https://www.math.colostate.edu/~adams/

Software

2) Hypothesis testing for TDA

Leader: Jessi Cisewski-Kehe

Software

3) R package TDA

Leader: Jisu Kim, http://www.stat.cmu.edu/~jisuk/

Software

4) UMAP Development

Leader: Leland McInnes

Software

5) Bug hunt for Scikit-TDA

Leader: Nathaniel Saul

Software

6) Example Generation Hackathon in Scikit-TDA

Leader: Christopher Traile

Software

7) Introductory TDA in R with TDAstats

Leader: Raoul Wadhwa

Software

Notes on getting software: Python, R, and beyond