Materials are now hosted on my Github
All the meet ups
- 2/19/2019 Identifying hotel renovations through machine learning
- 11/7/2018 Data Science at Lyft ft. Balaji Chandrasekhar
- 5/31/2018 Do online reviews improve hotel quality, causal inference with big data
- 4/26/2018 Modeling time series data with state space models ft. Jing Chen
- 3/29/2018 Version control and software development in R, ft. Justin Strate
- 2/22/2018 Data Demo Day: dashboarding at Virgin Galactic (Marco), NLP techniques (Yang)
- 11/29/17 Intro to Python for Data Analysis
- Python Intro
- Photo Geotag Example (time permitting)
- 10/25/2017 R Worskhop (Justin Strate)
- 10/4/2017 Web Scraping Workshop
Server Access(password TBA)- ESPN Example
- IMDB Example
- Links to tools
- Anaconda (for Python / jupyter notebook environment)
- Star Cluster (for server cluster on AWS and parallel computing)
- 9/30/2017 Data Jam (house price Kaggle competition)
Server Access(password TBA)- There’s a notebook in the code folder, please make a copy and play in your own kernel.
- 9/13/2017 Data Demo Day (CoBA 307)
- 8/23/2017 Data Demo Day (CoBA 307)
- Justin Strate: Bayesian Statistics
- Ivan Montes: Customer/Employee Satisfaction Analysis
- Dr. Suman Sirimulla: ML applications in drug discovery
- 6/14/2017 Data Demo Day (CoBA 307)
- Causal inference strategies in secondary data analysis.
- Code (iPython Notebook) Notes version (HTML – download and open to render math)
- 5/17/2017 Data Demo Day (CoBA 307)
- Estimating Yelp reviewer locations based on reviews (Manoj Saha)
- Predicting daily fantasy baseball outcomes (Carlos Beas)
- 5/4/2017 NLP workshop (CALC LAB)
- Code/Data (warning, big folder! ~ 10GB, you can download files separately): NLP dropbox, The script for Vegas works 100%, you will need to download the data + code folder and create the intermediate directory locally. Whole thing takes about 3-5 hours to run, change all the “if 0 == 1” blocks to “1 == 1” to run.
- 4/23/2017 Data Jam – Kaggle Competition – Predicting Property Rental Interest (meet at Hillside Coffee 11am)
- Kaggle Competition Link (2 Sigma / Rent-hop)
- Data Dropbox
- This data jam will be tool agnostic. However, I will be using Jupyter Notebook w/ Python. There is already a notebook uploaded which can serve as a starting point/baseline prediction.
- Please come with your laptop, try to install all your necessary tools beforehand.
- I will try to reserve an AWS server that you can access via Jupyter Notebook so you can run code in the cloud.
- Server address
http://54.68.227.214:8888/ - password: datascience!
- Server address
- Kaggle zipped contents (code – half finished)
- 4/13/2017 (UTEP CoBA 310 – CALC LAB #2)
- Set up an EC2 Instance
- Set up StarCluster
- iPyParallel with starcluster
- Notes:
- Choose the appropriate region
- Create ssh key for that region
- Modify the AMI, save it
- Change the default AMI to the modified one
- Python Demo of Web Scraping (rough code, will update)
- 3/30/2017 (UTEP CoBA 310 – Calc Lab #2) Intro to Python 🐍 Workshop
- Download Anaconda
Temporary AWS Server(go here to follow along w/o downloading)- Password: datascience!
- Files