Clint P. George

clint [at] iitgoa.ac.in
F-9, New Academic Block, IIT Goa

Clint P. George


I am an Assistant Professor in the School of Mathematics and Computer Science at IIT Goa. I was previously a postdoc at the Informatics Institute and Department of Statistics, University of Florida with Dr. George Michailidis. I received Ph.D. and M.S. from the Department of Computer and Information Science and Engineering, University of Florida. Dr. Joseph N. Wilson (CS) and Dr. Hani Doss (Stats) were my Ph.D. advisers. I completed B. Tech. in Computer Science and Engineering from the College of Engineering Thiruvananthapuram.

I work in the area of machine learning and applied statistics, which includes

  • Topic models
  • Empirical Bayes methods
  • Markov chain Monte Carlo (MCMC) and variational inference methods
  • Cancer genomics

During my Ph.D., I collaborated with the UF Data Science Research Lab, Dr. Daisy Wang (CS), and Dr. Bill Hamilton (Law), for the UF Law E-Discovery and SurveyMonkey datamining projects.

I code @clintpgeorge (Github).

Education



Employments



Research


Google Scholar | Scopus ID: 37561278500 | ORCiD ID: 0000-0003-3630-9811 | ResearcherID: D-1846-2011

Analyses of Multi-collection Corpora via Compound Topic Modeling. article arXiv code new
P. George, C., Xia, W., and Michailidis, G. (2019). The Fifth International Conference on Machine Learning, Optimization, and Data Science, Siena – Tuscany, Italy.
Investigating the Usage Patterns of Algebra Nation Tutoring Platform. article new
Akhavan Niaki, S., P. George, C., Xia, W., Michailidis, G., Beal, C.R. (2019). Proceedings of the 9th International Conference on Learning Analytics & Knowledge. Arizona, USA. p. 481-490. ACM.
The Impact of an Online Tutoring Program for Algebra Readiness on Mathematics Achievements; Results of a Randomized Experiment. article new
Akhavan Niaki, S., P. George, C., Xia, W., Michailidis, G., Beal, C.R. (2019). Proceedings of the 9th International Conference on Learning Analytics & Knowledge. Arizona, USA. p. 363-372. ACM.
Principled Selection of Hyperparameters in the Latent Dirichlet Allocation Model. article supplement code
P. George, C. and Doss, H. (2018). Journal of Machine Learning Research.
A Topic-Based Search, Visualization, and Exploration System. article
Grant, C., P. George, C., Kanjilal, V., Nirkhiwale, S., Wang, D. Z., and Wilson, J. N. (2015). FLAIRS-28, Hollywood, Florida, USA.
Latent Dirichlet Allocation: Hyperparameter Selection and Applications to Electronic Discovery. thesis
P. George, C. (2015). Ph. D. Dissertation. University of Florida.
SMART Electronic Legal Discovery via Topic Modeling. article
P. George, C.,  Puri, S., Wang, D. Z., Wilson, J. N., and Hamilton, W. (2014). FLAIRS-27, Pensacola, Florida, USA.
A Machine Learning Based Topic Exploration and Categorization on Surveys. article
P. George, C.,  Wang, D. Z., Wilson, J. N., Epstein, L. M., Garland, P., Suh, A. (2012). ICMLA, Boca Raton, Florida, USA.
Online Topic Modeling for Real-time Twitter Search. article
Grant, C., P. George, C., Jenneisch, C., and Wilson, J. N. (2011). TREC Notebook. NIST. USA.
Topic Learning and Inference Using Dirichlet Allocation Product Partition Models and Hybrid Metropolis Search. technical report
P. George, C., Glenn, T. C., Wilson, J.N., Gader, P. D., Fuentes, C., Gopal, V., and Casella, G. (2011). Technical Report. Computer and Information Science and Engineering, University of Florida.
Dirichlet Allocation Using Product Partition Models. technical report
Fuentes, C., Gopal, V., Casella, G., P. George, C., Glenn, T. C., Wilson, J.N., and Gader, P. D., (2011). Technical Report. Computer and Information Science and Engineering, University of Florida.
Morpheus: A Deep Web Question Answering System. article
Grant, C., P. George, C., Gumbs, J., Wilson, J. N., and Dobbins, P. D. (2010). iiWAS. Paris, France.
A Realm based Question Answering System using Probabilistic Modeling. thesis
P. George, C. (2010). MS Thesis. University of Florida.
Open Source Softwares
Latent Dirichlet Allocation: Markov chain Monte Carlo methods R package
Collapsed Gibbs sampler, grouped Gibbs sampler, serial tempering, hyperparameter selection, etc. (Blei et al. 2003; Griffiths and Steyvers 2004; P. George and Doss 2017)
Latent Dirichlet Allocation: Variational methods R package
Variational expectation maximization (EM) algorithm implementation for the fully Bayesian Latent Dirichlet Allocation (LDA, Blei et al. 2003) model


Teaching and Talks


Teaching
CS 344/386: Artificial Intelligence/Artificial Intelligence Lab (Fall 2019) course page
Indian Institute of Technology Goa.
CS 360: Introduction to Data Science and Machine Learning (Spring 2019) course page
Indian Institute of Technology Goa.
CS 664: Convex Optimization (Spring 2019)
Indian Institute of Technology Goa.
CS 344/386: Artificial Intelligence/Artificial Intelligence Lab (Fall 2018) course page
Indian Institute of Technology Goa.
CS 101: Computer Programming (Summer 2018). course page
Indian Institute of Technology Goa. With Sreejith A.V. and Sharad Sinha
Exploratory Data Analysis (December 2016). slides Jupyter notebook
DSI/Informatics Institute, University of Florida. Workshop Instructor.
COP 3502: Programming Fundamentals for CIS Majors I (Spring 2013).
Computer and Information Science and Engineering, University of Florida. Teaching TA/Lab Instructor.
Guest Lectures and Seminars
Introduction to Deep Learning (March 2017). slides R code
STA 6707: Analysis of Multivariate Data. Department of Statistics, University of Florida. Guest Lectures.
Topic Models for Text Analysis (March 2017). slides
Data Science and Informatics Symposium, University of Florida. Workshop.
Exploratory Data Analysis and Hypothesis Testing (October 2015).
CAP 5771: Introduction to Data Science. Computer and Information Science and Engineering, University of Florida. Guest Lecture.
Introduction to Topic Models (October 2014).
CAP 4773: Projects in Data Science. Computer and Information Science and Engineering, University of Florida. Guest Lecture.
Additive Models and Trees (March 2010). slides
CIS 6930: Elements of Statistical Learning. Department of Statistics, University of Florida. Seminar.
Latent Dirichlet Allocation: Hyperparameter Selection and Applications to Electronic Discovery (October 2015).
Machine Learning Reading Group, Computer and Information Science and Engineering, University of Florida. Seminar.
A Topic-Based Search, Visualization, and Exploration System (May 2015).
FLAIRS-28, Hollywood, FL, USA. Conference Talk.
SMART Electronic Legal Discovery via Topic Modeling (May 2014).
FLAIRS-27, Pensacola, FL, USA. Conference Talk.
A Machine Learning Based Topic Exploration and Categorization on Surveys (December 2012).
ICMLA-2012, Boca Raton, FL, USA. Conference Talk.