The previous series of blogs documented how causal inference arguments (difference in differences, regression discontinuity design, and natural experiments) are applied in big data settings within the online reviews domain. This domain also happens to be a great setting to quantitatively analyze textual data. We have a corpus of nearly 18 million reviews for hotels …
Continue reading Basic text analysis tutorial – string manipulations, basic sentiment analysis
Last time, I stopped with a preview of a TripAdvisor data on an animated gif map of the US. Here it is again: (Click to enlarge) I think the patterns in the timing of reviews (which I assume to approximate when people travel) are quite subtle, yet obvious. But when the obvious is visualized, it is generally more … Continue reading Fun with TripAdvisor maps!
My paper with Alex Chaudhry on management response to online reviews draws heavily upon crawled data from TripAdvisor. We began collecting data looking at Las Vegas hotels, a good starting source for sampling travelers from around the world. Everyone’s been to Vegas, and Vegas guests have been literally everywhere: We created the above plot by plotting … Continue reading Some empirical tidbits from TripAdvisor