You gain wisdom only when you look back, so think regressively more!
Data Scientist @ AccuWeather
PhD Student @ Penn State University
Tracy Shen is currently a Data Scientist at AccuWeather and PhD candidate for machine learning at Penn State University. She is also a research assistant at the Computational and Spatial Analysis Core of the Population Research Center(Nation’s top 3) at Penn State. There she manages 4TB geotagged twitter database and provides twitter analysis for the university grant writing leveraging Pittsburgh Super Computer Center and Hadoop and Spark environments. Her research interest is in computational social science utilizing machine learning, natural language processing and network analysis methodology. She’s affiliated to PIKE lab at the School of Information Sciences and Technology at Penn State, working on the SysFake project to detect fake news user susceptibility. Please see her full CV here.
Google BigQuery and Azure Cloud are both powerful platforms to store data. Google BigQuery can process a couple TB of data within a couple minutes and you pay when you query, store and process. The detailed pricing is here.Azure SQL data base provides fast and convienient data for the first 32 GB/month at ~$5/month. The detailed pricing is here.Normally business intelligence analysts, database managers or data scientists access the two platforms from the two consoles separately.
Recently, I’ve been playing with the deep learning python package ‘tensorflow’. I ran a simple linear regression model and had some success. Tensorflow is great with unstructured data and image recognization problem. Therefore, it usually runs better in a GPU supported computer. However, given my model is rather simple and won’t need to rely on too much image processing power like GPU. I did it on my windows 7 professional/10 machine and it predicted some values for me.
If you have viewed my bio, you probably noticed that I don’t have a science or engineering background, so how come I end up with data scientist? Well, it turns out that you don’t need a science or engineering background to become a data scientist! Here’s my personal trajectory. I think it’s highly reproducible:)
In Sep 2014, I was hired as a marketing research analyst by AccuWeather to work on AW’s new adventure: IoT or we call it emerging platform projects.