Community Data

Covid-19 Data Commons Toolkit

This is a collection of approximately 6000 datasets (after preprocessing) related to covid-19. The T-SNE plot is presented to visualize the BioBert embeddings created using the abstracts of the datasets. There are multiple interesting clusters formed in the dataset related to keywords like vaccine, icu, etc which we are exploring.


Exploratory Data Analysis

This is a collection of approximately 1250 datasets (after preprocessing) related to covid-19. Available for Users to perform Exploratory data analysis.


Jaccard and Cosine Scores

Choose any two of the below given files to get their cosine similarity scores.


Enter your First File: