Jonathan Lenaghan - Data Scientists at Work

Database Reference

In-Depth Information

mathematical consistency?” My wife happens to work in the same field, but

on the experimental side. She was working at accelerators while I was off in

the academic clouds.

After I left academics, I worked for an enjoyable couple of years as an assistant

editor for the Physical Review in Brookhaven. It was very interesting to see how

the review and referee process works, and how the sausage is made in the

academic publishing world. It set me to thinking a lot more about computation

and data. And I was in the vicinity of New York and the financial industry.

Going to work in the quantitative finance industry was definitely my first jump

into a very data-intensive world. Quantitative finance broadly divides into two

fields. In the first field, people work with very complicated financial instru-

ments, such as over-the-counter derivatives, which are traded very simply,

usually over the phone. There are mathematical frameworks around these

complicated instruments, which have theoretical, analytical, and computational

aspects. The second field is high-frequency algorithmic trading, in which you

have very simple financial instruments, such as equities, which are traded in

very complicated ways. This field is much more computational and algorithm-

driven. There's no analytical structure around this type of trading; it's essen-

tially driven by experimental data. I originally wanted to go into quantitative

finance to work in the first field dominated by analysis of complicated deriva-

tive products, but I ended up in the second field dominated by algorithmic

trading of simple financial instruments.

Gutierrez: What made you change your plan from doing purely analytical

work to data-driven work?

Lenaghan: I quickly learned that the empirical style of work in algorithmic

trading suited me very well. I liked the experimental design—figuring out what

works, what doesn't work, and how not to trick yourself with data. There are

lots of challenges working with data in the algorithmic world. One of them is

that thousands and thousands of other people are looking at the exact same

data sets, and basically they're all just squeezing everything they possibly can

out of it. Another challenge is that people under pressure to find patterns

are prone to fall into the common human fallacies of overfitting models with

insufficient data and overreading correlation as causation.

Gutierrez: How do the data challenges you faced in the algorithmic trading

world compare to the data challenges you face at PlaceIQ?

Lenaghan: The initial data challenge when I came to PlaceIQ was that geospa-

tial data was a data type that I had never worked with. The second challenge

was that the data volume was scaled up by a couple of orders of magnitude.

The volume of data in the algorithmic trading I was doing was quite large—say,

a terabyte a year. But the PlaceIQ environment generates hundreds and hun-

dreds of terabytes a year. Making these adjustments were exciting challenges

Search WWH ::

Custom Search

Home