BigGorilla is an open-source data integration and data preparation ecosystem (powered by Python) to enable data scientists to perform integration and analysis of data. BigGorilla consolidates and documents the different steps that are typically taken by data scientists to bring data from different sources into a single database to perform data analysis.  For each of these steps, we document existing technologies and also point to desired technologies that could be developed.  


The different components of BigGorilla are freely available for download and use. Data scientists are encouraged to contribute code, datasets, or examples to BigGorilla.  We hope to promote education and training for aspiring data scientists with the development, documentation, and tools provided through BigGorilla.

We make many decisions on a daily basis. However, it is easy to be sidetracked by urgent needs and short term goals, but fail to attend to activities that contribute to our long-term well-being and happiness. At Megagon Labs, one of our main research projects asks the basic question: can we develop technology that steers people toward behaviors that make them happier?


Our work is inspired by psychology research, especially a field known as Positive Psychology. We are developing "Jo" - an agent that helps you record your daily activities, generalizes from them, and helps you create plans that increase your happiness.  Naturally, this is no easy feat. Jo raises many exciting technical challenges for NLP, chatbot construction, and interface design: how can we build an interface that's useful but not intrusive.

Read more about Jo!

We are also working on creating a research platform that helps psychology researchers take advantage of the advancements in large-scale data collection and natural language processing. We hope the data science techniques Jo develops can be used to drive the state-of-the-art in psychology research. 

HappyDB corpus: 100,000 crowd-sourced happy moments

Screen reader support enabled.