Now we’ve got the data into a MySQL database, so next step is accessing it from our prefered application (sometimes, that means MS Excel, i.e.). So, let’s go:
1) Make sure you system is up to date (specially the Jet Engine).
2) Download and install the last MySQL ODBC Connector.
3) Go to Start_Settings_Control Panel_32bit ODBC and create a new DNS (choose MySQL driver).
Once finished, you’ll be able to access the data from MS Excel. However, you have to take into account you can’t view the data in the spreadsheet since the volume of rows is huge. The good thing is you’ll be able to calculate the statistics from there, or get data from pivot table, etc.
Similar Posts:
Yesterday I was talking to my friend and colleague Pau Rausell-Köster, from the Research Unit in Cultural Economics (Universitat de València), about the Netflix Prize. We were discussing about the foundations of taste and preferences, and how it was quite difficult to, by means of a devil reductionism, create a mathematical model that could predict how you’re going to rate a movie. The question was: it works!.
This conversation though led to another mathematical model it’s been used for a while by a company called Polyphonic HMI S.L. to predict if a song will be successful (aka “a HIT”). They use a methodology they have named as “Hit Song Science”, which basically uses “Spectral Decomposition” to get different musical attributes for all the songs they have analysed (3.5 million to date). They, they apply clustering techniques to the songs that have been a success (aka “a HIT”) in the last 5 years (I imagine, the time-frame is just to take out the trends and account for changes/evolution in people’s preferences). Then, they are able to predict if a new song will succeed in the market and they asign a rating (controlling type-I error).
There’s only a downsize: would the record companies invest in promoting songs with low rating?. This would affect the song to the extent of not helping it to become a hit, so, again our beloved maths would be changing the course of events and distorting the model by means of the feedback in flawed data (the reverse, type-II error, could as well happen, bad songs evaluated as possible hits being highly promoted and succeeding). Moreover, if this happens to be in a big scale, innovation in music creation is aborted…, unless… you’re brave and forget the model!.
PS For the Netflix Prize Teams: food for thought.
Similar Posts:
Get a group of people into an office for a 3-day weekend and they would come with a great idea for creating a company. Well, sorry, they’ll create a company. Which one is the latest created company by means of this methodology?: VOSNAP.
They define this web application (of course, is a dotcom) as a "social quick voting tool that reduces group think and time wasted on decision-making by giving you the ability to receive a quick vote from a group of friends on any decision. Users can create a poll, send it to their friends to quickly vote on via text message or email, and receive the results instantly". So, a dictatorship of the masses?, is this prefered to the dictatorship of the leaders?, or is it better a technocracy?. Actually, I didn’t want to convert this post into a political one, but discuss about the ways a decision can be taken, if the application (VOSP) was really breakthrough, and if the Startup Weekend was really effective. However, as always, I’ve ended up changing mood after quite a few words and just letting you finish the quarrel (in Spain we say: "tirar la piedra y esconder la mano", this is, "throw the stone and hide the hand", ringing the bell and running like hell!).
Similar Posts: