The goal of this project was to find a correlation between the price of stocks and google trends.
I would need to figure out a way to programatically enter a stock's ticker symbol and search it on Google Trends real-time. Then, I would need a way to extract and store the data. Lucky for me I used pytrends to make requests to google's servers to get my data.
I would compare the current real-time search trends of stocks to other stocks! I would find the outliers and display them on a table that is updated every few seconds.
I theorize if many people are researching about a company it is worth looking into. Maybe I could make a few trades myself (although I am not interested in that). I just want to maybe find a correlation between the search results (popularity) and a stock's intraday price.
- There are about 2000 companies I would like to constantly scan (search on google trends) real-time. Because of the mass amount of requests I used python threading to speed up this process.
- Google doesn't like when people are bombarding their servers with mutliple requests. Especially when thousands of requests are coming from one person or internet address. So, my IP address would frequently become blocked, and I would not be able to make requests. Solution: rotate my IP address with NordVPN, when needed. To do this in my program I used a module someone else created called NordVPN Switcher.
- Bad Data: Sometimes the data pulled from google trends did not say much about the popularity of the stock, but it peaked in my program's interest. In other words, the data fooled my method outlier detections. To fix this I created a function to detect and eliminate spiked data from the pulled data.