Arjun Mukherjee

Ph.D. Candidate
Department of Computer Science
University of Illinois at Chicago
851 S. Morgan St. Chicago, IL 60607


Fundamentals


I am a Ph.D. candidate in the Department of Computer Science at the University of Illinois at Chicago. I work with Prof. Bing Liu. My research interests include Bayesian Inference, Computational Linguistics, Sentiment Analysis, Opinion Spam, and Web Mining.


Current Research

My research focuses on mining actionable knowledge from social media. While there exist a myriad of directions in mining various information from social media, I am particularly interested in developing models and algorithms for predicting socio-psychological indices ranging from gender [Mukherjee/Liu/10], communication tolerance [Mukherjee/etal/13b], opinion spam [Mukherjee/etal/12; Mukherjee/etal/13a; Fei/etal/13; Mukherjee/etal/13c], contentions on viewpoints [Mukherjee/Liu/12a], socio-economic indices [Si/etal/13] etc. directly from unstructured social media. As a result, my research hinges on techniques from computational linguistics, Bayesian inference, and unsupervised learning. I am also interested in semi-supervised learning and inducing domain knowledge [Mukherjee/Liu/12b ; Chen/etal/13a; Chen/etal/13b ; Chen/etal/13c] for topic modeling, text and opinion mining applications.


Publications


[Chen/etal/13c]

Zhiyuan Chen, Arjun Mukherjee, Bing Liu, Meichun Hsu, Malu Castellanos, and Riddhiman Ghosh. Exploiting Domain Knowledge in Aspect Extraction. Accepted for Oral Presentation. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'13). October 18–21, 2013 — Seattle, USA.
[Paper]

[Chen/etal/13b]

Zhiyuan Chen, Arjun Mukherjee, Bing Liu, Meichun Hsu, Malu Castellanos, and Riddhiman Ghosh. Discovering Coherent Topics using General Knowledge. Proceedings of the ACM Conference of Information and Knowledge Management (CIKM'13). October 27 - November1, Burlingame, CA, USA.
[Paper]

[Mukherjee/etal/13c]

Arjun Mukherjee, Abhinav Kumar, Bing Liu, Junhui Wang, Meichun Hsu, Malu Castellanos, and Riddhiman Ghosh. Spotting Opinion Spammers using Behavioral Footprints. Proceedings of the 19th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'13). August 11-14, Chicago, USA.
[Paper]

[Si/etal/13]

Jianfeng Si, Arjun Mukherjee, Bing Liu, Qing Li, and Huayi Li. Exploiting Topic based Twitter Sentiment for Stock Prediction. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL'13). August 4-9, Sofia, Bulgaria.
[Paper]

[Mukherjee/etal/13b]

Arjun Mukherjee, Vivek Venkataraman, Bing Liu, and Sharon Meraz. Public Dialogue: Analysis of Tolerance in Online Discussions. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL'13). August 4-9, Sofia, Bulgaria.
[Paper]
This work was featured and aired on RTR FM. See podcast in RTR FM 92.1, or iTunes.

[Mukherjee/Liu/13]

Arjun Mukherjee and Bing Liu. Discovering User Interactions in Ideological Discussions. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL'13). August 4-9, Sofia, Bulgaria.
[Paper]

[Chen/etal/13a]

Zhiyuan Chen, Arjun Mukherjee, Bing Liu, Meichun Hsu, Malu Castellanos, and Riddhiman Ghosh. Leveraging Multi-Domain Prior Knowledge in Topic Models. Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI'13). August 3-9, 2013, Beijing, China.
[Paper]

[Fei/etal/13]

Geli Fei, Arjun Mukherjee, Bing Liu, Meichun Hsu, Malu Castellanos, and Riddhiman Ghosh. Exploiting Burstiness in Reviews for Review Spammer Detection. Proceedings of the 7th International AAAI Conference on Weblogs and Social Media (ICWSM'13). July 8-10, 2013, Boston, USA.
[Paper]

[Mukherjee/etal/13a]

Arjun Mukherjee, Vivek Venkataraman, Bing Liu, and Natalie Glance. What Yelp Fake Review Filter might be Doing? Proceedings of the 7th International AAAI Conference on Weblogs and Social Media (ICWSM'13). July 8-10, 2013, Boston, USA.
[Paper]
Detailed notes about traditional lies vs. fake reviews can be found in this addendum.
This work was featured in Crowdresearch.org. See blog post here.
A more detailed analysis appears in the following Technical Report:
Fake Review Detection: Classification and Analysis of Real and Pseudo Reviews. UIC-CS-2013-03.
[TR]

[Mukherjee/Liu/12d]

Arjun Mukherjee and Bing Liu. Analysis of Linguistic Style Accommodation in Online Debates. In Proceedings of the 24th International Conference on Computational Linguistics (COLING'12). December 8-15, 2012, Mumbai, India.
[Paper]

[Mukherjee/Liu/12a]

Arjun Mukherjee and Bing Liu. Mining Contentions from Discussions and Debates. In Proceedings of the 18th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'12). August 12-16, Beijing, China.
[Paper] [Slides] [Poster]

[Mukherjee/etal/12]

Arjun Mukherjee, Bing Liu, and Natalie Glance. Spotting Fake Reviewer Groups in Consumer Reviews. In Proceedings of the ACM International World Wide Web Conference (WWW'12). April 16-20, 2012, Lyon, France.
[Paper] [Slides] This work was featured in ACM Tech News, The Register, CNet, Mashable Tech, and many others. See other Media Coverage.

[Mukherjee/Liu/12b]

Arjun Mukherjee and Bing Liu. Aspect Extraction through Semi-Supervised Modeling. In the Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL'12). July 9-12, 2012, Jeju, Korea.
[Paper] [Slides]

[Mukherjee/Liu/12c]

Arjun Mukherjee and Bing Liu. Modeling Review Comments. In the Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL'12). July 9-12, 2012, Jeju, Korea.
[Paper] [Slides]

[Mukherjee/etal/11]

Arjun Mukherjee, Bing Liu, Junhui Wang, Natalie Glance, Nitin Jindal. Detecting Group Review Spam. In Proceedings of the International World Wide Web Conference (WWW'11), Poster. Mar 28-Apr 1, 2011, Hyderabad, India.
[Poster]

[Mukherjee/Liu/10]

Arjun Mukherjee and Bing Liu. Improving Gender Classification of Blog Authors. In Proceedings of the conference on Empirical Methods in Natural Language Processing (EMNLP'10). Oct. 9-11, 2010, MIT, Massachusetts, USA.
[Paper] [Slides]

External Bibliography Databases

DBLP

Google Scholar


Awards and Honors

Dean's Scholar Award , 2013
Chancellor's Graduate Research Fellowship (Two year in a row, 2013, 2012)
NSF SoCS Doctoral Symposium Scholarship, 2013
Facebook Ph.D. Fellowship Finalist, 2013
ACL Travel Award, 2013
AAAI-ICWSM Travel Award, 2013
EMNLP-CoNLL Best Reviewer Award, 2012
UIC Provost's & Deiss Awards for Graduate Research, 2012
UIC Graduate College Student Presenter Award, 2012
UIC Graduate Student Council Travel Award, 2012
KDD NSF Student Travel Award, 2012
UIC Graduate College Student Presenter Award, 2010
UIC Graduate Student Council Travel Award, 2010


Professional Service (PC Member/Reviewing)

Conferences:
International World Wide Web Conference (WWW 2014).
International World Wide Web Conference (WWW 2013).
Conference on Empirical Methods in Natural Language Processing (EMNLP 2013).
International Joint Conference on Natural Language Processing (IJCNLP 2013).
Conference on Empirical Methods in Natural Language Processing (EMNLP-CoNLL 2012).
International Conference on Web Search and Data Mining (WSDM 2013) (external reviewer).
International Conference on Web Search and Data Mining (WSDM 2012) (external reviewer).

Journals:
Data Mining and Knowledge Discovery Journal (DMKD), Springer Computer Science Journals.
World Wide Web Journal (WWWJ), Springer Computer Science Journals.
Knowledge and Information Systems (KAIS), Springer Computer Science Journals.
ACM Transactions on Asian Language Information Processing (TALIP)