Mims's Bits

AI That Picks Stocks Better Than the Pros

A computer science professor uses textual analysis of articles to beat the market.

Christopher Mims 06/10/2010

  • 14 Comments

The ability to predict the stock market is, as any Wall Street quantitative trader (or quant) will tell you, a license to print money. So it should be of no small interest to anyone who likes money that a new system that works in a radically different way than previous automated trading schemes appears to be able to beat Wall Street's best quantitative mutual funds at their own game.

It's called the Arizona Financial Text system, or AZFinText, and it works by ingesting large quantities of financial news stories (in initial tests, from Yahoo Finance) along with minute-by-minute stock price data, and then using the former to figure out how to predict the latter. Then it buys, or shorts, every stock it believes will move more than 1% of its current price in the next 20 minutes - and it never holds a stock for longer.

The system was developed by Robert P. Schumaker of Iona College in New Rochelle and and Hsinchun Chen of the University of Arizona, and was first described in a paper published early this year. Both researchers continue to experiment with and enhance the system - more on that below.

Using data from five non-consecutive weeks in 2005, a period chosen for its lack of unusual stock market activity, here's how AZFinText performed versus funds that traded in the same securities (which were all chosen from the S&P 500):

And here's how it performed compared to the top 10 quantitative mutual funds in the world, all of which draw from a much larger basket of securities, except of course for the included S&P 500 itself:

Software that analyzes textual financial information - quarterly reports, press releases, news articles - is nothing new. Researchers have been publishing on the subject since at least the mid-1990's.

However, previous approaches to this technique were hampered by either poor performance (averaging little better than chance) and / or requirements for unreasonable amounts of computational horsepower. Schumaker and Chen get around these issues by first radically shrinking the amount of text their system has to parse by boiling down all the financial articles the system ingests into words falling into specific categories of information.

Interestingly, these techniques and categories derive from classification schemes described at the 7th Message Understanding Conference, held in 1997, which was a Defense Advanced Research Projects Agency project to create new and better ways to extract information and meaning from texts. (At the time, they were concentrating on terrorist activities in Latin America, airplane crashes, rocket and missile launches and other things relevant to national security.)

Schumaker and Chen's system concentrates on Proper Nouns - people and companies - and combines information about their frequency with stock prices at the moment a news article is released. Using a machine learning algorithm on historical data, they look for correlations that can be used to predict future stock prices.

Further work with the AZFinText system has revealed oddities that may or may not remain relevant as researchers continue to apply it to other bodies of historical stock market and financial news data. For example, in a paper described on June 6 at the Computational Linguistics in a World of Social Media workshop, Schumaker went fishing for the Verbs most likely to cause a stock to move up or down in the next 20 minutes, and came up with a list of 211 terms that had some power to move stock prices. (In his work, 'verb' is a technical term, and does not exactly correspond with the conventional definition of the word.)

According to Schumaker:

The five verbs with highest negative impact on stock price are hereto, comparable, charge, summit and green. If the verb hereto were to appear in a financial article, AZFinText would discount the price by $0.0029. While this movement may not appear to be much, the continued usage of negative verbs is additive.

The five verbs with the highest positive impact on stock prices are planted, announcing, front, smaller and crude.

Schumaker did not attempt to determine why these particular terms move stock prices, but it's interesting to note that the stock market does not appear to like the marketing buzzword "green," but is quite happy to hear any news at all about the term "crude," as in oil.

Print

Close Comments

To comment, please sign in or register

Forgot my password

kbillet

59 Comments

  • 609 Days Ago
  • 06/10/2010

Hum, Sounds Like a Plan!

Maybe we need to create some news articles for it to find with lots of these keywords.
I've got to question the ethics of using such software?  Doesn't add any value to the Markets.  Just the opposite.  It sucks value from the Markets with little to no risk.  I wonder what the difference is between using this to make money and a Ponzy Scheme.  Think about this next time you "Gamble" in the market. That's right, it use to be investing.  Greed does strange things to mankind.

Reply

David Dennis

2 Comments

  • 605 Days Ago
  • 06/14/2010

Re: Hum, Sounds Like a Plan!

This kind of software is so easy to write, at least in theory, that there would be no way to ban it even if it was truly immoral. And I see no evidence that it is. 

I don't see it as being anything like a ponzi scheme - that's where someone takes investments promising huge returns,  and gives early investors great returns by paying them with the money raised later. Eventually it collapses because not everyone can get paid.

This is just a way of doing dispassionate investment analysis. The isolation of emotion is why it can work.

However I am not sure if this is a statistically significant difference. Let's see results over a year and then we know if we have something.

D

Reply

devassocx

110 Comments

  • 608 Days Ago
  • 06/11/2010

picking stocks

any company that understood the rules that this
software uses could easily tailor their own press releases to use the 'hot' terms even for releases that contained bad news and thereby defeating the
system.

I tend to agree with another poster that stuff like
this should be illegal.

Reply

mattgroom

289 Comments

  • 608 Days Ago
  • 06/11/2010

transfers

The data shows that it was used as a part of a fund, ie if you as an individual invested in that fund you'd get a 8.5% gain on your money.

We wont see a removal of the electronic transfer of commodities...But im sure you could build a similar system yourself.

Reply

mkogrady

423 Comments

  • 608 Days Ago
  • 06/11/2010

Re: transfers

I'll still use a professional money management team to handle my long term investments. However, if this code "found" its way on to the Internet and some day-trader could get a copy to play around with, I would entertain the idea of pulling some money from my Savings account that pays a measley interest rate and apply this tool to see if I could pull in 8% or better.

Like Vegas though - pull your winnings off the table as fast as you can.

Reply

StupidPeasant

98 Comments

  • 608 Days Ago
  • 06/11/2010

can't be stopped

To make it illegal is to give it only to the powerful few; so that is what will happen. Then that will be gotten around. The singularity is coming, like it or not. The whole concept of trading stock the way it is will crash someday, but not yet.  The transition period will be messy and unfair. First, programs will be written to produce millions of varied stories. More programs will be written to counter it. They will be written at light speed by computers. The humans will need to step back. Trading stock has no real value anyway. Gathering funds to bring an idea to market should be the only purpose of stock. How do we, the scum, make money in the mean time is the question.

Reply

akousari

1 Comment

  • 608 Days Ago
  • 06/11/2010

another dumb system

I have seen so many magical systems that promise it all. The first problem is that the experiment cannot be conclusive based on 5 weeks of data. Second we shouldn't compare apples with oranges because mutual funds have longer time perspectives. Third these systems work very well for very short periods of time and they usually get beaten by the market. Lsst but not least as more people use the system the opportunities offer by the systems diminishes over time and experienced traders will act against it.

Reply

Advertisement

bhumphus

1 Comment

  • 608 Days Ago
  • 06/11/2010

AI That Picks Stocks Better

Sorry guys, but I wrote a BASIC program that did this in 1983. True, I had to hand input the info into the original software but by the late 1990s, this worked in VB with automatic inclusion of the data. I have used it to benefit me for the last 30 years and can't understand why this idea is such an important issue today. Picking the correct parameters just takes a bit of research and testing. I even wrote my dissertation based on the concept! Have fun.

Reply

mikedu

19 Comments

  • 608 Days Ago
  • 06/11/2010

He should test this in Asian stocks like China or the Philippines

If its really reliable, and counting that it filters   false financial news aimed at subverting accuracy, he should try out Asian stocks like China or the Philippines. 8.5% fluctuations in short to medium term trading seems more of the norm here, at least for now.

Reply

atrev

1 Comment

  • 607 Days Ago
  • 06/12/2010

where to get that software?

Reply

anonymole

4 Comments

  • 602 Days Ago
  • 06/17/2010

Public means useless

The very fact that this trading technique has been exposed means that it no longer works. Bottom line? No one explains valid methods of how to make money if that method will actually make THEM money. Once a method loses its efficacy they then take them and try to sell them to the public as viable. Traders who make money are those traders you will never hear about. Period.

Reply

PringnirP

1 Comment

  • 601 Days Ago
  • 06/18/2010

Probabilities will change

The system will need to constantly scan the news looking for the newest key words, as these will change daily.

If the system is incorporated into a Fund, then other traders will simply follow that fund's actions and negate any advantage the Fund had.

Reply

rizwan79

1 Comment

  • 567 Days Ago
  • 07/22/2010

retirement benefits

it seems that retirement benefits will go as a person takes a premature retirement.http://www.blogofstocks.com

Reply

jgerstle

4 Comments

  • 546 Days Ago
  • 08/12/2010

Engineered B.S.

The idea of using the past to predict the present for stock prices using "side" information - textual analysis of news or otherwise, seems plausible in theory.  Basically it's just another form of logistical regression modeling. If the input factors yield a high degree of explanatory power for the model's outputs, then it is reasonable for predicting an incremental next-step along the stock price trajectory. 

However, every paper I've ever read about doing this type of thing amounts to little more than an academic excercise.  Generally, the examples are either highly engineered to produce a desired outcome (data mined) or are so trivial that to find enough examples occuring with any frequency in the real world would be another exercies all to itself.

Finally, I am willing to bet that the authors of this system didn't have any significant amounts of money invested in the model and/or have not accounted for transaction costs, which may also vary significantly over time given different market environments. 

In the real world, the other difficulty in making this a profitable algorithm would be in taking large enough positions in each stock without eviscerating any potential gains to be had.  I think others have mentioned this point too - if I am using this algorithm and it is easily discovered which news feeds to look at and which keywords result in predictive outcomes, then as another market participant I will just do the same.  If there are enough partipants all doing the same thing, then there will be no informational advantage and these short lived gains will be competed away.

Reply

Bio

Christopher Mims is a journalist who covers technology and science for just about everybody.

Subscribe to the Mims's Bits RSS Feed

Advertisement
Advertisement

Facebook

Advertisement