Basic text analysis tutorial – string manipulations, basic sentiment analysis

The previous series of blogs documented how causal inference arguments (difference in differences, regression discontinuity design, and natural experiments) are applied in big data settings within the online reviews domain. This domain also happens to be a great setting to quantitatively analyze textual data. We have a corpus of nearly 18 million reviews for hotels …

Advertisement

Continue reading Basic text analysis tutorial – string manipulations, basic sentiment analysis

Some empirical tidbits from TripAdvisor

My paper with Alex Chaudhry on management response to online reviews draws heavily upon crawled data from TripAdvisor. We began collecting data looking at Las Vegas hotels, a good starting source for sampling travelers from around the world. Everyone’s been to Vegas, and Vegas guests have been literally everywhere: We created the above plot by plotting … Continue reading Some empirical tidbits from TripAdvisor