Covid-19 Data Commons Toolkit
This is a collection of approximately 6000 datasets (after preprocessing) related to covid-19. The T-SNE plot is presented to visualize the BioBert embeddings created using the abstracts of the datasets. There are multiple interesting clusters formed in the dataset related to keywords like vaccine, icu, etc which we are exploring.
Exploratory Data Analysis
This is a collection of approximately 1250 datasets (after preprocessing) related to covid-19. Available for Users to perform Exploratory data analysis.
Jaccard and Cosine Scores
Choose any two of the below given files to get their cosine similarity scores.