Dr. Yang Wang

February 24, 2017March 7, 2017

Data Science Meetup in El Paso

I am looking to start a data science Meetup in El Paso. If you or your friends have interest in data analytics, big data, business intelligence, programming, digital marketing, etc. check out the Meetup group. The idea, hopefully, is to get both academic/UTEP affiliated people and local industry people to participate so that we can … Continue reading Data Science Meetup in El Paso

February 15, 2017February 15, 2017

Automatic Neighborhood Detection

Data clustering is one of the most fundamental components of the machine learning toolkit. Many of these algorithms are easily implemented in Python, making it the go to language for data scientists when implementing canned routines. At the PyData conference last fall, I attended this talk by Leland McInnes & John Healy which introduced the HDBSCAN … Continue reading Automatic Neighborhood Detection

August 29, 2016August 17, 2018

Revised and Better?

In the spirit of disseminating our latest edition of the manager response to online reviews paper, Alex and I have posted the current manuscript to SSRN for those who are interested. Let us know your thoughts via email. http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2831402 Abstract: This manuscript investigates the externalities of managers’ responses (MR) to online reviews on popular travel … Continue reading Revised and Better?

September 2, 2015February 15, 2017

Basic text analysis tutorial – string manipulations, basic sentiment analysis

The previous series of blogs documented how causal inference arguments (difference in differences, regression discontinuity design, and natural experiments) are applied in big data settings within the online reviews domain. This domain also happens to be a great setting to quantitatively analyze textual data. We have a corpus of nearly 18 million reviews for hotels … Continue reading Basic text analysis tutorial – string manipulations, basic sentiment analysis

July 5, 2015August 11, 2015

Fun with TripAdvisor maps!

Last time, I stopped with a preview of a TripAdvisor data on an animated gif map of the US. Here it is again: (Click to enlarge) I think the patterns in the timing of reviews (which I assume to approximate when people travel) are quite subtle, yet obvious. But when the obvious is visualized, it is generally more … Continue reading Fun with TripAdvisor maps!

June 27, 2015September 2, 2015

Some empirical tidbits from TripAdvisor

My paper with Alex Chaudhry on management response to online reviews draws heavily upon crawled data from TripAdvisor. We began collecting data looking at Las Vegas hotels, a good starting source for sampling travelers from around the world. Everyone’s been to Vegas, and Vegas guests have been literally everywhere: We created the above plot by plotting … Continue reading Some empirical tidbits from TripAdvisor