Visualising What Diplomats Say

This onging project involves the mining and analysis of the publically available press releases of the Ministry of External Affairs, India. The dataset was aggreagted using the official MEA website.

Currently, Bag of words analysis have been carried out on the dataset, like Topic Modelling. We aim to use language models, and perplexity changes in the language models to determine how the diplomatic press releases have changed over India's history. We also plan to use this time series as a latent variable model, in order to uncover and account for the language change.

Mention Frequency of Countries from 1997

Word Cloud of the most common words