Projects
Doctoral Work
- Aide-memoire: A tool to link issues and pull-requests in an online fashion by predicting which issues (PRs) relate to other PRs (issues). It makes use of a Mondrian Forest model that should be trained on a project before it can make predictions. It is composed of a backend (GitHub Link) and a Chrome plug-in to interface with the backend (GitHub Link)
- POSIT: A tool that makes use of a CRF-biLSTM model to segment and tag text that mixes English and code snippets. It was trained on a combination of C code and StackOverflow. Project Page
- Flexeme: A tool that untangles commits into atomic patches using graph kernel similarity and agglomerative clustering. It was validated on an artificial corpus of tangled commits for 9 C# projects. Project Page
Internship Projects
- Graph-kernel-based detection of anomalous events in spatiotemporal data: anomalies are those points that stay closely together for abnormal lengths of time or disperse suddenly. This work was done as part of an internship at the National Institute of Informatics in Tokyo, JP, under the careful supervision of Asoc. Prof. Mahito Sugiyama.