Episodes
Audio
Chapters (AI generated)
Speakers
Transcript
Amanda Cox on Working With R, NYT Projects, Favorite Data
Data. Datastores is supported by Tableau software, helping people see and understand their data. Get answers from interactive dashboards. Wherever you go for a free trial, visit Tableau. Datastories.
Amanda CoxI would give two of my left fingers for this. Data.
Moritz StefanerDatastores is supported by Tableau software, helping people see and understand their data. Get answers from interactive dashboards. Wherever you go for your free trial, visit Tableau software@Tableau.com. Datastories. That's Tableau.com Datastories.
Summer in Germany AI generated chapter summary:
Enrico: I've heard you're having better summer in Germany than in New York. The weather is real bad. Perfect weather to record a podcast inside, right? It's perfect. Any news? No, not really. Pretty flat and boring.
Enrico BertiniHi, everyone. Data stories number 56. Hey, Moritz.
Moritz StefanerHey, Enrico. How are you doing?
Enrico BertiniHow's it going? I've heard you're having better summer in Germany than in New York.
Moritz StefanerStarting with the weather again. It's a strong opening. Yeah, the weather is real bad. It's horrible. I'm wearing a scarf. That's not.
Enrico BertiniYeah, no, we don't have a scarf issue here.
Moritz StefanerNo.
Enrico BertiniBut, yeah. Not very sunny, though. Yeah. What can I do?
Moritz StefanerPerfect weather to record a podcast inside, right?
Enrico BertiniIt's perfect. Yeah. Yeah.
Moritz StefanerWhat have you been up to? Any news?
Enrico BertiniNo, not really. Pretty flat and boring. No, I'm doing some work, but no. No major news, no scandals.
Moritz StefanerNo excitement.
Enrico BertiniNo scandals. Let me think.
Moritz StefanerYeah, same here. I'm just holding up, trying to progress with a project.
Enrico BertiniYeah, it's that part of the year.
Moritz StefanerIt's like that, the pre vacation finishing up time.
Enrico BertiniYeah, I'm not sure when I'm gonna have vacation, but. Yeah, fine, fine. Don't complain.
Moritz StefanerMaybe.
Enrico BertiniWell, my family is leaving soon to Italy. I'm so jealous.
Moritz StefanerYeah, but then maybe you can finally get some work done. Who knows?
Enrico BertiniYeah. I don't know. Yeah. There is this whole theory that you can do more work when you have a family rather than less.
Moritz StefanerTotally.
Enrico BertiniYou have more motivation. Maybe. I don't know. Or maybe it's just bullshit.
Moritz StefanerCould be. Anyways, I would say, as we don't have much to say, let's just bring our guest on. So today we have a super special guest, and it's Amanda Cox. Hi, Amanda.
Amanda Cox On The Ellen Interview AI generated chapter summary:
Today we have a super special guest, and it's Amanda Cox. Great having you here. How are you? I'm well. Thanks for having me.
Moritz StefanerCould be. Anyways, I would say, as we don't have much to say, let's just bring our guest on. So today we have a super special guest, and it's Amanda Cox. Hi, Amanda.
Amanda CoxHi.
Enrico BertiniHi. Great having you here. How are you?
Amanda CoxI'm well. Thanks for having me. Yeah, you set a high bar with the introduction weather, and there's nothing going on.
Moritz StefanerYeah, that's like, what else could we now talk about? You know, it's all being said. Amanda, can you introduce yourself? I think most people know you, but still, it's. Yeah.
How a statistician landed at The New York Times AI generated chapter summary:
Amanda is a graphics editor at the New York Times. She has a background in statistics before the time. How did she end up at the Times? There aren't a lot of trained statisticians in newsrooms. The field has grown amazing.
Moritz StefanerYeah, that's like, what else could we now talk about? You know, it's all being said. Amanda, can you introduce yourself? I think most people know you, but still, it's. Yeah.
Amanda CoxI am a graphics editor at the New York Times, one of dozens. And, yeah, I have a background in statistics before the time. So the work that I do tends to be a little bit more statistically oriented than some of my colleagues who are, say, better artists or developers or whatever else.
Moritz StefanerAnd how did you end up at the New York Times? Did you study statistics and then directly go journalism, or did you do different things in between? What was the story there?
Amanda CoxYeah, I was in grad school, and I hated it. So the first year that you're in statistics grad school, they don't let you touch any data at all. It's all epsilons and greek letters. And so I didn't know what else I wanted to do with my life, but it wasn't epsilons and greek letters. And so I started applying for random things, in part just to see what I would be disappointed. Went by when they rejected me. And one of them was to be the Times intern on the graphics desk that summer. And so they were in a. Steve, Dwayne was in a think outside the box mode. Why not hire an intern who doesn't know any of the skills that we use in our work? And so I spent the summer as an intern, then I went back to grad school. My second year of grad school is better because they let you touch data. But near the end of that, I got a call. They said, hey, we have a slot. Do you know anyone? I said, I do. I know her very well. And so that was how I fell into the times, right through the internship program was my introduction to it.
Moritz StefanerAnd at the time, would you say you were the most, like, technical or statistical minded person on the team or.
Amanda CoxDefinitely not technical. Maybe proper statistics, you know, certainly. Probably the most formal definition of uncertainty or whatever, but definitely not technical because.
Moritz StefanerThere were a few hackers and coders around.
Amanda CoxYeah. Matt Erickson was doing who knows what, and Bill McNulty was doing fancy stuff with maps at the time. Sure. So. Yeah. Yeah.
Moritz StefanerBut it's true. I mean, you won't see so many trained statisticians in newsrooms, maybe in the nineties or something.
Amanda CoxRight.
Moritz StefanerBecause it seemed a bit remote from. Yeah. What a typical journalist profile might be. Right?
Amanda CoxYeah. I mean, there aren't a lot of, I think, trained statisticians in the nineties and the world, period. It's a field that has grown amazing. Like, you see the people who talk about their undergraduate program, enrollment in statistics, and those charts are all, you know, they're exponentially growing at a lot of the big schools, so. But, yeah, you're right that.
Moritz StefanerYeah, it used to be a really unsexy job, right? Like, yeah. And now it has the same.
Amanda CoxI don't know if it was ever unsexy, especially for, you know, I was always. I feel like actuary was always rated, like, you know, most charming career that you can, like, make a solid chunk of money and go home at 05:00 without any stress. Right? I don't know.
Making Charts in R AI generated chapter summary:
Amanda Keen: When did you start doing, creating charts and visualizations? Keen: My very first job after college was as an RA at the Federal Reserve Board. Keen: Most of that work in r still, not all now, but most of it. What's your experience with R and how it evolved over the years?
Enrico BertiniSo. And Amanda, when did you start doing, creating charts and visualizations? Right away, or there has been a development over time.
Amanda CoxYeah. So my very first job after college, I worked as an RA at the Federal Reserve Board. And their sort of, your price of admission is supposed to be to make the briefing charts for the economists. They have these meetings with the chairman, and they bring these books of charts. And if you're a good Ra, I think you're supposed to kind of hate making the charts. And then, but you do that 50% of the time. So your other half of the time, you get to spend on your research or whatever. Right? Like, I only liked making the charts. And so, and then, you know, when I was in grad school, I got myself into a point where I'd acquired like three or four different research assistantships. And instead of doing the work I was supposed to be doing the statistical work, I would just make what I would call, like the art of the week, which was just like a chart that fooled people into thinking I was actually working because I couldn't juggle all these properly at the same time. And so I think both of those, you can tell a coherent, backward looking story. In terms of making charts, though, I think it's cheating a little bit because I think there's probably 100 different things I could have ended up doing. And you could tell some kind of coherent story. If, you know, you cherry pick your.
Moritz StefanerHindsight, it almost makes sense.
Enrico BertiniAnd is it making charts something that you got or learned out of school or something that you learned afterwards or just on your own? Just curious to hear about, because I'm not aware of how degree in statistics work, but I know that statisticians have been developing a lot of interesting charts. And in general, there is a lot of research on visualization, on the statistics side. So I'm curious if you also get taught this kind of stuff in school or not.
Amanda CoxI had just one class. It was like an elective class. I was taught by a guy named Thomas Lumley, who's a brilliant statistician and a hilarious man. But it was a not very much work kind of elective. And so it was not the most rigorous part of my statistical education.
Enrico BertiniAnd so. And how long have you been at New York Times right now? I think it's been quite a while, right?
Amanda CoxTen years. This summer will be my anniversary. And I know that. I know that only because my first week, Hurricane Katrina happened, which was a gentle introduction to the grassroots.
Moritz StefanerQuite a start.
Amanda CoxSo, yeah, ten years this summer. A long time.
Moritz StefanerWow, that's cool. Yeah. And you use mostly r for at least for developing, exploring data and developing your basic visualization ideas? That right?
Amanda CoxI do still do most of my sketching. We've acquired this data, or we need to acquire this data from somewhere on the Internet. What does it look like? How might we approach it? I do most of that work in r still, not all now, but most of it.
Moritz StefanerAnd is it because the tool, is it like. Mostly it's one tool where it gets so fluent that it's just fun to work with? And could it also be with another tool, the same relationship? Or is it something about R and the packages it offers or the workflows that you think make it uniquely actually the best tool for the type of work you do? What's your experience with R and also how it evolved over the years? I guess you have followed the development quite a bit.
Amanda CoxYeah, I think for me a lot of it is just being fluent. I think there's nothing particularly special about R other than it just being the greatest software on earth, so. But there's, you know, I'm not actually using r for many of the things that R is great at. You know, we're not usually fitting very complicated models or doing that kind of work, but I think part of it is fluency and, but part of it is some of the choices made in r being written by people who don't really care about data structure or care about, want to allow users to, you know, do whatever they want with data and reshape it. And, you know, sort of that kind of things that come for free are useful. You know, the idea, like, I don't want to draw my own axies, I want you to draw them for me. Right. Like some of that kind of stuff, stuff that comes for free. Those decisions, I think, are uniquely delightful in our.
Moritz StefanerAnd how is it organized? So it's a little. So it has its own programming language, right, and its own idea of how data should be structured in order to work with it properly.
Amanda CoxSo it's, I think it largely doesn't care how data should be structured. I mean, so there are things like data frames, which you can think of as just an excel spreadsheet, you know, rows and columns functionally. And the data frame, I think, is languages that don't have a data frame. I don't understand how anyone does anything interesting in them, especially with structured type data, things that are rows and columns, you end up just hacking together your own version of a data frame. If it doesn't come with a data frame for free, sure.
Moritz StefanerAnd then there's lots of packages or different libraries. Is there a technical term?
Amanda CoxProbably thousands of them. And some people claim that's some of the success of our development. But I think it's really interesting now what's going on in that space. And there's a lot of people, me included, who have worked basically the same way and are for a very, very long time. And in the last six months, year and a half, I don't know, there have been some really great packages. So Hadley Wickham is sort of an r celebrity, but I never fallen into the Hadley verse before. But his link, do you mean Hadley Wickham? Hadley Wickham? Yeah, his latest. He has a package called dplyr, which you can, and some other people's works too, that there's this idea of chaining commands together. So first I sort, now I filter, then I mutate some columns in my data. And that stuff for the first time is really fun. And there's other people working on stuff, too. That makes r feel a little bit more natural with the Internet. Hadley is working on some of that, too, but that work space is exciting. So I think r is becoming a lot more of a grown up for the type of work that we do. It's changed a lot, I think, in the past year or so.
Enrico BertiniSo you've been using it from the very beginning, or there was something that you used before?
Amanda CoxNo, from the beginning.
Enrico BertiniOkay.
Moritz StefanerYeah. I mean, I always wanted to get started with r because everybody was like, oh, you have to use r, it's so cool. But I never got into it, really, because. Exactly. This whole ecosystem was so overwhelming. I think it's a bit like. So I do web development a lot, and I know that ecosystem quite well, but I could imagine if you get started with that, it's the same. It's like there's a billion libraries that do basically the same thing, but 90% of them are broken, and you have to figure out which. And it's super complicated. And then I felt you have to learn a lot by heart or like, yeah, just learn these commands from that package. And if you use a different package, they are different. It's like, it seems like a big hurdle there. Do you have any tips, like how to get started? If you're confused as I am.
R2.0: Starting with R AI generated chapter summary:
The only big limitation I see for doing visualization with R is that as you need it, it seems to be very, very limited. Do you have any tips, like how to get started?
Moritz StefanerYeah. I mean, I always wanted to get started with r because everybody was like, oh, you have to use r, it's so cool. But I never got into it, really, because. Exactly. This whole ecosystem was so overwhelming. I think it's a bit like. So I do web development a lot, and I know that ecosystem quite well, but I could imagine if you get started with that, it's the same. It's like there's a billion libraries that do basically the same thing, but 90% of them are broken, and you have to figure out which. And it's super complicated. And then I felt you have to learn a lot by heart or like, yeah, just learn these commands from that package. And if you use a different package, they are different. It's like, it seems like a big hurdle there. Do you have any tips, like how to get started? If you're confused as I am.
Amanda CoxThat's a good question. So many people are Hadley fans, and so you could just decide for our work, I am only going to reuse packages, written libraries, written by Hadley and that's. It would be one. And that would get you of the type of work that we do. That would get you 95% of the weighted done. And so stuff would work kind of consistently or just stick and stay roughly in base r. Like base r does 85% of the work that we do. So just no libraries at all. I only want points and lines and text, and I've been sort of that way for a while too. We're not doing stuff that's that complicated. We're merging data and filtering it, whatever. You can do that with a dozen commands. I scraped my. Not scraped, I processed all of the r code I had written at one point, and there were only 100 things I use all the time.
Moritz StefanerSo you were looking for which commands you used the most? Yeah, that's super interesting. Cool. Good idea. Meta analysis. Yeah, yes, yeah, yeah, yeah, yeah. That's a good tip, though. Like sticking with like one guy and then get into his mindset and then, you know, it makes sense. Hopefully that's a good idea. So I might give that a shot.
Amanda CoxI'll let you know how it's time marks you'll like.
Enrico BertiniBut I have to say that this is the kind of tip that I give to people who want to start doing visualization, that there are so many tools out there and you can get attracted by all of them, but it's a huge mess. Right. I think it's much, much better to stick with one and learn it very well. And especially another part of it is rather than choosing something that is new and is going to disappear tomorrow, I always say choose between the few, very few options that are out there that probably are going to be here in five or even ten years. Right. And there are not many that have these characteristics.
Moritz StefanerYeah, that's true.
Enrico BertiniYeah.
Moritz StefanerAnd now what do you output mostly with R? Like, would you like output mostly PDF's and then review the PDF's and annotate those? Or do you output like inductive little dashboards or what's your typical product you would produce with R?
Amanda CoxYeah, no, mine is mostly PDF's. And so if you're making a print graphic, that's great because you can just open it in illustrator and clean it up. And if you're making a web graphic, it's. PDF is a little disappointing. Now you have a place to start, so. But even I am a believer even for sketching in things that we call like, poor man's animation. Poor man's animation is just like holding the page down, key down on a PDF, your own flipbook.
Enrico BertiniYeah.
Moritz StefanerBut you can drop back and forth and. Yeah, I mean, PDF's do have some advantages, for sure.
Amanda CoxYeah. So, you know, if you want to make, like, you know, deeply interactive work, it's probably not the best way to start.
Enrico BertiniYeah. This is actually a question that I had for you because I think the only big limitation I see for doing visualization with r is that as soon as you need interaction, it seems to be very, very limited. Right.
Amanda CoxI mean, people have changed that, I think, a lot in the last year or so. Hadley works at a company called RSTudio, and they have this thing called shiny, which allows it's not the best interaction. And there's another guy blanking on his name who does this stuff that up. Two reals, D3 in a really graceful way if you're just making the same type of scatter plot over and over and over again. So some of that people are making, I feel like, real progress currently on connecting r to the web in a deeper way than it has been before.
Moritz StefanerAlso, Tableau can make calls to r subroutines, basically. So if you want to do complicated statistics in the background, you know, you can. Yeah. Write a routine that calculates something, gives the results back and these things. That's super interesting, I think. Yeah. Because many of the tools are so isolated and all have their strengths and. Yeah, that's kind of cool.
How to start learning R in 2017 AI generated chapter summary:
Amanda: Download the RSTudio and just do whatever. For learning, it's like, have a problem that you actually want to solve. You can scrape and plot and analyze and clean your data all in the same place. So, like, figure out the project that it is that you want to do and then become an r disciple.
Enrico BertiniSo is there Amanda, if some of our listeners want to start with her, do you have any suggestions where to start? Maybe if there is a nice tutorial you like or, I don't know, whatever. A book maybe?
Amanda CoxYeah, I think I just go to RSTudio way. So download the RSTudio and just do whatever. Play with it, whatever it is they tell you to do. I'm sure there's an intro there somewhere. I think. For learning, though, you'll never learn anything if you try to learn that way. So for learning, it's like, have a problem that you actually want to solve. And so our problems are problems where first I need to scrape some data off the Internet, and then I don't know what it looks like, but I want to try on different chart forms or just sort of explore the data I think are the best, sort of, like, at least in our space, the best learning with our kind of things. And so some of the advantages from that, like, you know, that you can scrape and plot and analyze and clean your data all in the same place. That's some of our strengths from my end. So, like, figure out the project that it is that you actually want to do that fits those, those categories and then become an r disciple.
What's the Workflow at The New York Times? AI generated chapter summary:
D3 is the obvious for all the interactive work. All of our print work and many of our web mock ups go through illustrator. We'd be lying if we didn't pretend that Excel was a workhorse in the department.
Enrico BertiniSo is there any other major tool that you guys use at New York Times other than r or r ching?
Amanda CoxYou know, D3 is the obvious for all the interactive work. And then other people, you know, you spoke to Gregor, I think, about some of the charting tools he's made for. Very simple charts where, you know, I just want to make a bar chart. So some internal tools for that. All of our print work and many of our web mock ups go through illustrator. We'd be lying if we didn't pretend that Excel was a workhorse in the department.
Enrico BertiniYeah, of course. So. And how does this work at New York Times? So can you describe a little bit what is the process, how a new piece starts and how do you get to the final graphics? I'm sure a lot of listeners are curious about what's the process there.
Inside The New York Times' Graphics Team AI generated chapter summary:
The process happens in a bunch of different ways. One graphics editor is attached, at least one is attached to all the major desks. Increasingly, we do our own work on breaking news. It's often small teams, two, three, four people whose skillsets overlap.
Enrico BertiniYeah, of course. So. And how does this work at New York Times? So can you describe a little bit what is the process, how a new piece starts and how do you get to the final graphics? I'm sure a lot of listeners are curious about what's the process there.
Amanda CoxSure. So I think the process happens in a bunch of different ways. One graphics editor is attached, at least one is attached to all the major desks. And so they go to their meetings, and increasingly, we do our own work. So here is an issue that I think is important to the world, or here is a news event that we need to respond to. We've always sort of done our own work on breaking news because there's not really time to coordinate. So we have a meeting in the morning around 1015, or Steve or Archie or someone else who leads to death says, here are the paper's priorities for the day. How are we going to respond to them? And then those of us who are not working on something daily go back to work that may run this weekend or two weeks from now or some time, unknown time in the future. And the work, it's often small teams, two, three, four people whose skillsets overlap, but usually someone's a stronger reporter, someone's maybe a stronger designer, someone's a stronger developer, and so very collaborative, kind of. But those teams are sort of organic in terms of just what an idea needs to get done.
Finding the Right Data for a Project AI generated chapter summary:
One of the main problems is also finding the right. The right data for the problem that you want to. My favorite data are where the data is not a struggle. The most interesting data is probably not the data that comes up after your first Google search.
Enrico BertiniAnd I guess, at least for some projects, one of the main problems is also finding the right. The right data for the problem that you want to. Right. I remember, I think I met Kevin Quilly a few weeks ago or days ago. No, weeks ago. And he was telling me this interesting story about a recent piece that you've done on upshot. I think it was on what people eat at. What was it about?
Amanda CoxChipotle.
Enrico BertiniChipotle. Which is hard. How do you get this data? And I guess there are instances where finding data is probably straightforward than other cases where you have to, I don't know, maybe even give up.
Amanda CoxYeah, I think my favorite ones are where the data is not a struggle. Like, I'm totally uninterested for the most part, in data that, like, you can just. It's the result of your first Google search or whatever those are. Those are not our, you know, sometimes if you have something clever to say about it or some. Some news peg to attach it to or something, but those. The acquisition or the acquiring of the data, the figuring out what their fallback plan is for that one in particular. Kevin and I were teaching a class at NYU at the time, and our fallback. Fallback. Fallback plan was that that week's homework assignment was going to be to make all of our 20 students go figure out each are responsible for 20 people's Chipotle orders or something. So, yeah, I think engaging with the world and the most interesting data is probably not the data that. That comes up after your first Google search.
The Art of Finding Where Films Come From AI generated chapter summary:
Here's a fun one about sort of just acquiring with technology. Sean Carter wanted to know where trailers come from in movies. Technology also becomes your journalistic tool, more or less, to actually find something out.
Enrico BertiniSo do you have any interesting story about data that has been particularly hard or, I don't know, fun to find?
Amanda CoxHard or fun, huh? Here's a fun one about sort of just acquiring with technology. A few years ago, I talked about this one a lot, but Sean Carter wanted to know where trailers come from in movies. So, you know, are the scenes in the trailer, do they come mostly from the beginning of the movie or the middle of the movie or the end of the movie? And so he had set up some code to, you know, essentially test it to reduce the images to, you know, their edges and then to just test, you know, does black and white from the trailer match the black and white stills from the movie? And one of them we were really struggling with was Argo. It was a best picture nominee. I think this was in 2013. And I assumed it was just because Sean's code was bad. Right. He'd never done edge matching before or image process, not a ton of image analysis. I just assumed he didn't know what he was doing or. The other problem was that the movies, we didn't have the movies from the most official sources in all cases, so who knows what was going on? But it turns out that the trailer for Argo, they're cheating in a few places. So, for example, there's a scene in the movie that shows the Hollywood sign in California, like, it looked like in the 1970s, the movie set in the 1970s, and it was falling down in disrepair. Like, half the letters were, like, on the ground but in the trailer, they just showed the Hollywood sign now, because all they want to do is read Hollywood. And then.
Moritz StefanerSo they used scenes that were not in the movie, actually.
Amanda CoxYeah, but very similar. So there was one scene of people looking at a tv screen, and in the movie it was a woman who was a leader in Iranian hostage crisis who was in the tv screen, but in the trailer, they just swapped that for Jimmy Carter's face. Just because it reads better. It reads as seventies or Hollywood, not. It's like, what is that? I don't understand. Whatever. So that was a fun one.
Moritz StefanerReally thought about the trailer, all the sweated, all the details. Yeah. Nice.
Amanda CoxI mean, it was a fun one in that we learned things through the acquisition of data that I don't think we would have learned if we could have just called up a publicist and said, like, tell me where the trailer comes from. Like, what scenes in the movie does it come from? But then, so, you know, Sean has acquired some of these image processing skills. And so then we applied them to a story about forgery in chinese art, you know, a couple, a year later or six months later or something. And so I thought it's both a fun example of acquisition and also the tools influencing the kind of. Or your skillset influencing the kind of work that you do.
Moritz StefanerYeah. And then technology also becomes your journalistic tool, more or less, to actually find something out, which is pretty cool.
Amanda CoxYeah.
Enrico BertiniAnd I think in the case of Chipotle, you ended up doing like scribbling something online.
Amanda CoxNo. One of our colleagues, Ellen McLean, had gotten a huge dataset from Grubhub for a different story, for just like an enterprise project several years earlier. So Alan had not worked at the Times for maybe three years by the time we ran the Chipotle graphic. But when we couldn't get data from an official source, we said, well, we have this huge data that Alan got just lying around that we never done anything with. And there were thousands of Chipotle orders in that dataset.
Tableau AI generated chapter summary:
This week, data stories is supported by Tableau software. Tableau lets people connect to any kind of data and visualize it on the fly. In the latest version, Tableau nine, you'll find features that make the product smarter. To find out more, get your free trial@Tableau. com.
Moritz StefanerThat's a good time to take a little break and talk about our sponsor this week. This week, data stories is supported by Tableau software, helping people see and understand their data. Tableau lets people connect to any kind of data and visualize it on the fly. Databases, spreadsheets, and even big data sources are easily combined into interactive visualizations, reports and dashboards. In the latest version, Tableau nine, you'll find features that make the product smarter about what you are doing from a new start experience with data prep tools to more analytics features and smart maps with geographic search, and across the entire analytical flow, they have invested really heavily in performance and new features to help you share your findings and collaborate with data. So what is your data trying to tell you? To find out, get your free trial@Tableau.com. Datastories. That's Tableau.com Datastories. And now back to the show.
Has Interactive Annotations Become Involveful? AI generated chapter summary:
Can annotation itself become interactive? I think probably the most consequential graphic of all time has interactive annotation. There's no space for the annotation on mobile. Mobile has killed annotation in many ways.
Moritz StefanerThat's a good time to take a little break and talk about our sponsor this week. This week, data stories is supported by Tableau software, helping people see and understand their data. Tableau lets people connect to any kind of data and visualize it on the fly. Databases, spreadsheets, and even big data sources are easily combined into interactive visualizations, reports and dashboards. In the latest version, Tableau nine, you'll find features that make the product smarter about what you are doing from a new start experience with data prep tools to more analytics features and smart maps with geographic search, and across the entire analytical flow, they have invested really heavily in performance and new features to help you share your findings and collaborate with data. So what is your data trying to tell you? To find out, get your free trial@Tableau.com. Datastories. That's Tableau.com Datastories. And now back to the show.
Enrico BertiniSo, shall we read some of the questions that our listeners posted? There are some interesting ones, maybe some of the stuff we covered already. Yeah, that's a good one. I think this is from Mushon. So Amanda Cox said, annotation is the most important thing we or they at New York Times graphics do. Can annotation itself become interactive?
Amanda CoxSo I think probably the most consequential graphic of all time has interactive annotation. Do you know which one I'm talking about? Do you have a guess?
Moritz StefanerInteractive annotation?
Enrico BertiniYeah, I think it depends what interactive annotation means. I'm not even sure.
Amanda CoxSo I want to claim that the GPS dot on your Google map, or whatever it is, is the ultimate important and actually changing the world, uh, kind of version of interactive annotation. Uh, and then if you don't buy that that is real annotation, then I can say, well, you know, Google will let me know in text on the map that you have a restaurant reservation here tonight or not, because I told Google, but just because it's in my email. And so I think for sure, uh, that is an annotation.
Moritz StefanerLike, you mean the blue dot, right? Like where you are, right?
Amanda CoxYeah, yeah, the, where you are. Yeah, I think.
Moritz StefanerSure.
Amanda CoxAnd I think it's super interactive. Right. You move and it updates. So I think that's the people are doing. So if you think that's cheating, I think there's lots of other examples, too. So Jake Barton and some of his museum work is doing this kind of cool stuff where they are trying to teach kids about physics. And so essentially how it works is the kids just do whatever they want, and then they draw stylized lines on top of the path of the ball they threw automatically.
Moritz StefanerYeah. So that's basically dynamic annotations maybe, right. It's like you don't just have, like, a static text pinned to a position in the graphic, but somehow you're reacting to what the user does or what the context is of an information query, and then you construct an annotation. Right. So you also recently had this article where the. The content, well, actually the text contents of the whole article changed depending on the user selection in a map right. Can you tell us a bit more about. I don't, I can't recall how much you involved in that project, but it was a New York Times project for sure.
Amanda CoxYeah. So, yeah. Kevin and Gregor and I and some others, Matthew Black and some of our colleagues in INt worked on a map about essentially what are good places and bad places for kids to grow up. So if you have to grow up poor, if you grow up in some places, you end up doing better on average than you would if you had grown up in a different place. And so I think we know, or we suspect that if we just show people a Us county map, the way you interpret it is, you see, like, oh, what does my county look like? And then learn about the broader world first. So we decided to sort of invert it on this one and say, like, let's not show you the map and force you to interact with it. Like, in some ways just get rid of interaction altogether and say, let's just guess about what you care about most based on where you're reading this from. So we'll geolocate where you're reading from and just show you a very zoomed in map for your area and then update the text in a relatively straightforward way. So I don't know if I consider that example annotation. But then after we did that one, we did another one off of the same research where we asked people to draw a chart for us and we talked about annotating that drawing. We talked about annotating that drawing as you were drawing it directly in the chart. But there's just to tell you, here's what you're doing. We could have, but the problem is there's no space for the annotation on mobile. Mobile has killed annotation in some ways. You just don't have room to say anything interesting. And so we killed that idea and just put the same text that we would have essentially put on your drawing inside of it, pointing to something just below, but also a little bit dynamic. I wouldn't call either of these super dynamic. There's baked out combinations.
Moritz StefanerYeah, but it was fairly sophisticated. I remember Gregor posted part of the logic sometime on Twitter and they were like, 1015 different categories of curves you try to identify. Like, is it more s shaped or is it consistently below or above?
Amanda CoxYeah, there were ten or 15 different types of curves. There were basically three types of lines and then some four types of lines and then some edge cases about whether you did weird stuff at the beginning or the end of your line. So it wasn't so crazy complicated.
Moritz StefanerYeah, I think it was fairly dynamic. Tell us more about the project. I think it's been a super surprising and extremely fun project, and not all of our listeners might know the basic idea behind it.
The Hidden Truth about How Much Money People Earn AI generated chapter summary:
Harvard researchers have created a chart that links income to chance of going to college. The site asks you to draw the curve. Then comments on that and shows you the real curve, but also shows you how others have drawn. It's a fun project that forces you to think through statistics.
Moritz StefanerYeah, I think it was fairly dynamic. Tell us more about the project. I think it's been a super surprising and extremely fun project, and not all of our listeners might know the basic idea behind it.
Amanda CoxOh, sure. So there's these researchers at Harvard on this one, and they have this amazing data set. I would give two of my left fingers for this data about how much people earn, and they've been able to link that to their parents earnings for basically everyone in the United States who's around my age. So who's around, you know, 30, 35. And so this data, the drawing example was, what's the chance that you go to college, depending on your parents income or your parents income rank, too. So, you know, if you were raised in the 25th percentile family, you have this percent of chance of going to college. And the chart is just, just amazing. And so part of the reason it's cool is because they have so much data. Lots of people have done this exercise for quartiles or quintiles, but that's, you know, you get four data points, five data points. It's not, it's not that sexy, but so they have, you know, they did it for every percentile. So they have 100 data points, and they form, essentially a perfectly straight line, which I find just mind blowing. The idea that, you know, it's, there's some, you know, you know, of course it's averages, and there's a lot of variation in between. But on average, the power of classes is so straight, right? Like, it doesn't top out. The difference between being raised in the 10th percentile and the 20th percentile is the same as the difference between being raised in the 80th and the 90th, which I find just like a crazy idea. But then the question is, like, how do we make this chart that's a straight line? And some people, like, be excited about this, have your mind be blown about, because it's just a stupid, you know, it's just, it's perfectly straight.
Moritz StefanerIt's not that interesting, and it's actually exciting statistically. But it doesn't look exciting, right?
Amanda CoxYes, exactly right. Like, it's like, it's like, the fact that it is linear is amazing to me. You know, linear throughout the whole income spectrum, you know, not basically linear in the middle, but just linear, linear, linear, linear. I think it was actually, you know, Kevin's idea. He was like, well, what if we just ask, like, famous people, you know, smart, famous economists to do it for us? He had a couple in particular he was thinking about. But then that just evolved into, like, what if we just ask everyone to do it? So.
Moritz StefanerYeah, because if you had asked celebrities or, like, you know, famous people, it's sort of shaming, then, you know, like, oh, they are so wrong. Right.
Amanda CoxSure.
Moritz StefanerAnd if you try it yourself, I think, what's so smart about it? So the site asks you to draw the curve. What do you think? Like, how is the curve looking? And then comments on that and shows you the real curve, but also shows you how others have drawn. So it's not just presenting you with facts, but first of all, asks a question that really forces you to think through, like, what is my model like? You know, can I develop a model in my head now of how I think the world works in this little, like, question and then also compare that, like, to how other people think and so on. And I think that this type of knowledge sticks so well, right, where you actually had to, where you were, like, hypothesizing yourself, like, how could it be? Is it like an s curve or is it like exponential? You know, and suddenly you learn so much about statistics because you have to apply it yourself. Right?
Amanda CoxYeah. There's some cognitive psychology. I tried for like a half hour before we published this project to find it and to say, you know what? People who do this professionally actually think about this sort of stunt. And I didn't come up with anything great. But after we published, there were some education professors and others who, talking amongst themselves on Twitter, were passing around some interesting papers about how it's, in a lot of teaching, it's better to actually get it wrong first, struggle with it and then fix it at the end.
Moritz StefanerAnd often people have this idea that then you have the wrong conceptual idea in mind. But I absolutely agree. In my experience, if you ever actually struggling with something and then finally you succeed or, you know, or finally you figure it out that it's there and it doesn't go away anymore. Yeah. Jake Barton, I also saw, he had this really nice slide. I just saw the photo of the slide. But it was also about this phenomenon that if you tell people the answer straight away, like, if you just present facts, you know, then people forget the question. And actually the question is the whole, the exciting part. Right? It's like, yeah, if you just answer this question straight away, the question itself becomes totally irrelevant. Yeah. It's solved.
Enrico BertiniWhich has quite a lot of implications on the way we teach in general, I guess.
Moritz StefanerSo many. How many responses did you get? It was quite a lot. Right?
Amanda CoxYeah. So I think I forget how many. So we have. We're saying, I think, 100,000 people, roughly on the version of the graphic that we publish right now. But that's only people who were logged in and registered. We did something a little bit different with people who were anonymous. So logged in and registered plus anonymous is a lot bigger than that. So hundreds of. Wow.
Moritz StefanerWow. So actually, you could now do a whole social study on how people think reality is.
Amanda CoxWe did, Kevin did do a follow up post the next about, you know, what people.
Moritz StefanerThat's for sure, what people thought.
Enrico BertiniSo shall we read another question?
Has the Upshot New York Times changed your workflow? AI generated chapter summary:
Scott Murray: How has launch of upshot New York Times changed your workflow? Murray: Most data journalism is successful when you don't need the adjective. How do you see this developing? There's also, of course, 538, Nate Silver's maybe similar data journalistic publication.
Moritz StefanerYes.
Enrico BertiniOkay. We have some few ones from Scott Murray. So let's start from the first one. How has launch of upshot New York Times changed your workflow? And I think there is another one that is similar to that. Probably we can group them together.
Amanda CoxYeah, sure. So the upshot is sort of like a section at the New York Times. I guess in some ways, it has a handful of reporters attached to it, people who are experts in the economy or politics or healthcare or gender and technology. And so we're, you know, a relatively small section within the times. And so I've been attached to it for, for about a year and a half now. One of the ways where I say it's changed things for me, I'll tell you, tell you an anecdote, is that I like to say, like, we have a little bit more freedom to what I sometimes call, like, making things up. And David Leonhardt, who's the editor of the upshot, he hates when I say that. He calls it analytical judgments. And so you can tell why, why one of the two of us used to be the Washington bureau chief of the New York Times, and one of us is, one of us is a graphic sensor. But I think we have a little bit more freedom in making some of these analytical judgments. So the example, or one of the examples that I'm proud of is on election night, you know, we're sort of doing a little bit of live analysis of the results as they came in to say, you know, like, given what we know so far, here's how we think the night might end up. Like that kind of an exercise is one of the ways, I think the upshot is maybe a little bit different. The other way, obviously, Gregor made us, made the times, this tool called Mister chartmaker, that enables reporters, and the upshot reporters are the ones who get access to this tool to make their own charts, to essentially say, if you want to put a bar chart in your story, that's great. But I don't want to spend my afternoon on it. And so that has changed, I think, our interactions in some way. So both in good and bad ways. Right. Like the good way, obviously, they do not interrupt me when they want a bar chart in their story. And the bad way, I think we sometimes miss some opportunities that graphic senators and other sections would have picked up on about. Like here you have a stub of an idea, and that would result in a totally different idea that I can make something really cool out of. But that is one way the upshot works differently than the rest of the times. Right.
Moritz StefanerNow, how do you see this developing? There's also, of course, 538, Nate Silver's maybe similar data journalistic publication. And then there's the upshot. These are, I guess, the two big data journalistic platforms at the moment. Right. And how do you think, will this develop? Will they stay the same in character as they are now? Does that affect also maybe, like, do you do now all the data heavy stuff and the New York Times becomes a bit more lighter in data? Or how do you think this relationship will develop? Is it not at all, but just different presentation, or how do you see the relationship?
Amanda CoxYeah, I don't think upshot graphics are aggressively different from, or even different at all from the rest of the New York Times graphics. I also think that most data journalism is sort of a, how do I say this in a polite way? I think data journalism is successful when you don't need the adjective. So, you know, I've said before that, like, the more adjectives you have, like, the less power you have, right? Like, you never want to be like the assistant deputy vice president or whatever. Or you know that, like, you know that gay marriage is real when it's just marriage, right? When you don't need the adjective in front of it. So I think data journalism is only successful when it's, when it's indistinguishable from journalism journalism.
Moritz StefanerBut still, I mean, upshot and 538, they have a data flavor to them. And I think that's, that's the idea, right? It's like, these are data ish outlets, right?
Amanda CoxYeah. I think that is, you know, Nate Silver's conception of journalism is that, you know, the journalism that he is good at is very data focused. I think that upshot has many reporters who are on the spectrum of that attitude, though obviously believe that, you know, I'm basically indifferent to the mechanism behind your story, right? Like, I always think, for example, like, you know, whenever a New York Times story has to tell you how many interviews it conducted. Like, to me, that's an inner signal to stop reading, right. If you have to say, like, the time spent six months talking to 85 people, that's just saying, like, and we didn't find anything. So we just have to impress you by, like, how much Harvey tried, tried.
Moritz StefanerReally hard, but still didn't work.
Amanda CoxYes. Right. Because if you found something cool, you would just tell me the cool thing. You wouldn't tell me that, like, we had to talk to 85 people, you know, like, whatever. And so I feel the same way about, like, data. Like, I don't care really, like, how you get to truth if it comes from quotes or if it comes from a spreadsheet or whatever. Like, I find that to be not the really interesting part. Like, the how. The how part I think is, like, not, I don't care that much about it.
Moritz StefanerBut that means you trust your journalists quite well, right? Like, if you say, well, the exact method, I don't even want to go there, or I don't even want to know how large the sample size was because I trust these people.
Amanda CoxNo, I think you still have to, you know, you're still showing it's a different type of evidence. Right. Like, I'm saying that I've not, by default, I don't value some types of evidence more than other types of evidence. Right. Like, you still need to lay out your case about why, you know, why this is a proper view of the world and why these people know what they're talking about and how this data was collected. Methodological issues, you know, you still are doing, you know, you're arguing with evidence. But I don't think there are certain types of evidence that on their face are, you know, blanketly better, you know, for certain types of problems, sure.
Moritz StefanerBut no, sure, you could have like a million data points, be horribly wrong. Like, and even, you know, you do the right thing. Technically, yeah, but there's like, this huge bias in the data set or you totally missed one obvious, like, yeah, just flaw, and then the whole thing breaks down. You can have millions of data points.
Amanda CoxOr it doesn't really mean what you think it means. So yesterday, behind me, Jeremy Ashkenas was looking at hate crimes reported in the United States. And different jurisdictions do it differently. So apparently there was only one in the state of Mississippi for a whole year. So, you know, like, it's not, the data isn't actually about hate crimes. It's about the reporting of hate crimes. So, you know, like, all of those.
The bias of hate crimes AI generated chapter summary:
The interpretation of data is super important. Have you ever stopped the factually right stories because you had the feeling people might get it wrong? There are huge biases in how things are reported.
Amanda CoxOr it doesn't really mean what you think it means. So yesterday, behind me, Jeremy Ashkenas was looking at hate crimes reported in the United States. And different jurisdictions do it differently. So apparently there was only one in the state of Mississippi for a whole year. So, you know, like, it's not, the data isn't actually about hate crimes. It's about the reporting of hate crimes. So, you know, like, all of those.
Moritz StefanerKind of issues how do you deal with that? Like, how much do you think about what will people think when they read that? As opposed to you knowing something is factually right? Like, have you ever stopped the factually right stories because you had the feeling people might get it wrong?
Amanda CoxSure. Or I'm trying to think of a good example, but I'm sure there are. Yeah, sure. I think in odd ways about, like, that's not how I actually interpret this or. Hmm. I'm trying to think of a good example. I don't have one off the top of my head, but, yeah, I mean, the interpretation of it is super important.
Moritz StefanerYeah, yeah, yeah. I just had to think of, like, the guardian had like a actually quite nice interactive application where they would show all the. Well, nice. In this context, not a good word, but it was about police killings and, like, who, you know, who was killed by police. And technically they had all the statistics. Right. But for instance, they just reported the percentages of victims and then the race or the ethnicity. Yeah, but it would probably have made more sense to relate that to the share of population as well. Like, you know, so. Because in their data, it sounded like there's not many Hispanics and many blacks affected, but if you would relate it to the actual share in that population of that ethnicity, it's much higher. So, yeah, these are all these things, like, if you just report the numbers and you're not super careful about how they will be read, then you're in trouble.
Enrico BertiniYeah. And even then it wouldn't be perfect because the process behind data collection for these kind of data sets is very complicated.
Moritz StefanerRight, right.
Enrico BertiniThere are huge biases in how things are reported. Yeah. Should we continue with the questions?
Moritz StefanerMore questions. Yes.
How to teach data visualization AI generated chapter summary:
Kevin Quailey and I teach a class at NYU in the journalism program. He says it's inversely how much you prepare on paper to how good it is in the classroom. If you start with interesting questions, students learn so much faster, he says.
Enrico BertiniSo, yes, two more. One is, I don't know exactly what he means, but he just says, thoughts on teaching? I don't know if it means. Yeah. How to teach visualization or something like that.
Amanda CoxI think you may know. A couple weeks ago was at IO, and we walked back at night, like the nighttime venue was like a half hour away from the hotel. And we walked back together one night, and I was telling him the story about. So Kevin and I. Kevin Quailey and I teach a class at NYU in the journalism program. Sometimes science, sometimes studio 20, sometimes other things about data, largely. I wouldn't call it data visualization necessarily, but I was telling him that I think the days that we go in, like, totally unprepared, not totally unprepared, we go in like a surgeon. Right? Like, we call it, like, going in like, you know, like, I've prepared my whole life for this, so I just show up to do what I tried to do. Those days, I think I have a tendency to be like, way better classes, like, both more fun, both more pedagogically engaging. Both people actually learn things more than days where I spend like 4 hours preparing some guided bullet point tutorial, classically super prepared. So I was telling him that I find there's this interesting thing. It's inversely how much you prepare on paper to how good it is, which is maybe he actually wanted a straight answer about teaching, but that's what I think of. That's what I think of in the last week when I think of.
Enrico BertiniNo, but I have to say this totally resonates with my experience as well. I mean, as soon as I just try to be the professor and just spreading my wisdom through the classroom, it just doesn't work. Right. But if you manage to start with a couple of interesting questions and you make it clear that not necessarily you already know the answer, it's just an exercise that we are doing together, then everything gets so much more interesting. I have to say, for instance, in my, I've been teaching my visualization course three or four times already. And the part where I see my students learning a lot is really like when they show me what they've done, I sit next to them and I say, hey, look, you should have. Maybe you should try this and that. And they learn so much faster this way. Whereas if I present slides for ten days and then I ask a question, I'm always disappointed by the answer. Right. So I think that's a very interesting thing that happens when you teach visualization. And probably it's not only with visualization that this happens.
Amanda CoxYeah.
How To Tolerate Kevin Quilly AI generated chapter summary:
And the next question was, how to tolerate Kevin Quilly. Kevin and I say that all of the bad parts of ourselves are the same. I think our memoirs now is just titled disagreements.
Enrico BertiniAnd the next question was, how to tolerate Kevin Quilly.
Amanda CoxIt's a great question.
Enrico BertiniPoor Kevin. We should have him on the show sometime.
Amanda CoxKevin and I say that all of the bad parts of ourselves are the same. And then. So there, there are also, you know, Kevin has many good parts that are unique for me, but all of the bad parts of us are exactly the same. And so I tolerate him. But we also. So we used to joke for a long time when we disagreed about something that there would be a chapter in our memoirs called disagreements. Right. But lately, lately that joke has evolved into, like. So it's no longer just a chapter in our memoirs. I think our memoirs now is just titled disagreements. And so it's like, you know, chapter, we'll stick this in chapter four of, you know, not or chapter seven or, you know, so it's no. So Kevin and I were both attached to the upshot. We sit, you know, two and a half feet from each other. We teach together. My. So Kevin is. Kevin is a good friend of mine, so.
Enrico BertiniBut on a little, little more serious note. So you've been talking about disagreeing on something. I'm just wondering now. I guess it happens sometimes, maybe even often, that during the process of creating a new piece, you disagree. So not just you and Kevin, I think in general. So what kind of disagreements you guys have? Is it more on the story itself, on the interpretation, on the way to visualize something? I'm just curious about that.
In the Elevator With Kevin Flanery AI generated chapter summary:
Kevin: I hate video. Kevin, I like complicated things. Kevin likes tables. Complicated tables are the worst kind of tables. Is it more on the story itself, on the interpretation or on the way to visualize something?
Enrico BertiniBut on a little, little more serious note. So you've been talking about disagreeing on something. I'm just wondering now. I guess it happens sometimes, maybe even often, that during the process of creating a new piece, you disagree. So not just you and Kevin, I think in general. So what kind of disagreements you guys have? Is it more on the story itself, on the interpretation, on the way to visualize something? I'm just curious about that.
Amanda CoxYeah. So Kevin likes video. I hate video. Kevin, I like complicated things. Kevin likes tables. What else? What are the other classic ones?
Moritz StefanerYou can make a complicated table?
Amanda CoxComplicated tables are the worst kind of tables, right? If you're gonna make a table, just make a table. That's like the worst position to be in.
The History of the Histogram AI generated chapter summary:
The histogram is an interesting example of how defaults matter a ton in software. Part of the reason they die is because it's not easy to make a histogram in excel. I think the histograms are a fun example of ways defaults in software change people's approach to the world.
Enrico BertiniDo you guys have any favorite chart? Chart type?
Amanda CoxI mean, we're calling 2015 the year of the histogram. I think the histogram is an interesting example of how defaults matter a ton in software. So, you know, histograms, they teach them in kindergarten. Like, no joke. That's one of the first charting forms you ever experience. You know, how many of us are six? How many of us are seven, whatever, right? And then they die at some point. And I think part of the reason they die is because it's not easy to make a histogram in excel. And so I think the histograms are a fun example of ways defaults in software change people's approach to the world, even when they're not necessarily aware of it or not necessarily biased against histograms. Just biased because it's hard. In the same way that, like, for a long time I was, you know, biased against bezier curves because they weren't trivial in r for, you know, kind of like how, how defaults influence what you think about.
Enrico BertiniYeah. And I'm curious to hear from you, what's your take on complicated charts or more than complicated type of charts that may be effective, but people don't know how to interpret right away? I think there are a quite solid set of charts that you can expect people to understand, at least to some extent, and some other chart types that people have probably never seen. Right. So how do you handle that?
More Complex Charts AI generated chapter summary:
I'm a fan of maximizing what I call net joy. What's your take on complicated charts that may be effective, but people don't know how to interpret right away? I think there are a quite solid set of charts that you can expect people to understand.
Enrico BertiniYeah. And I'm curious to hear from you, what's your take on complicated charts or more than complicated type of charts that may be effective, but people don't know how to interpret right away? I think there are a quite solid set of charts that you can expect people to understand, at least to some extent, and some other chart types that people have probably never seen. Right. So how do you handle that?
Amanda CoxYou probably know I say it a lot, but I'm a fan of maximizing what I call net joy. So if you cause, like, one person, infinite joy or understanding or whatever with your chart, and, like, the rest of the world is numb to it. It's possible that that is better than, you know, just doing something that, like, almost all of the world is, like, slightly above numb to. And so I, in the maximization problem that is these decisions, I'm a fan of arguing for net, and whereas a lot of people, I think, are fans of arguing for minimum, and that difference between net and minimum, I think, changes a lot of how you think about things.
Moritz StefanerYeah. You can easily show that in a histogram, by the way.
Amanda CoxYes, very much so. Yeah.
Amanda on Interest in Data AI generated chapter summary:
Amanda: Data that is really great data is not on the Internet. That's the proprietary data that people actually make decisions off of. A lot of my best work probably really is commodity data. Are you interested in topics that have been explored already?
Moritz StefanerThat's great. Shall we come to Lynn's questions?
Enrico BertiniLet's go there.
Moritz StefanerYeah. The first one was, Amanda, do you have any burning topics of interest that you would like to work on but you cannot get the data for it?
Amanda CoxSure. So, I mean, I think I said this earlier, that data that is really great data is not on the Internet. And so I think I'm interested in all kinds of data. That's the proprietary data that people actually make decisions off of, whether that's your Walmart, target, Google, driverless cars, slash medical policy, whatever, all of that kind of stuff. Like, you know, any data that people actually make actionable decisions off of is stuff that I think I'm kind of jealous of.
Moritz StefanerBut let's say you were working then at Uber, or, you know, some other company that has, like, super deep, interesting data. Wouldn't you then be bored? Because then it's also, like, readily available and just too easy to get.
Amanda CoxMaybe. Or, you know, after six months, maybe. Right? Like, you run out of. You're like, oh, but then it stops being like, Uber. It's like you're like, oh, my Uber grader is great, but it'd be really great if I can merge it with proprietary data from whatever other place. So it's always like, you're always attracted to what you can't have, right?
Moritz StefanerIt's like the grass, the data sets you can easily find on the Internet. Are you not interested because you would assume they have been explored already sufficiently, or the low hanging fruit is maybe already done? Or is it more?
Amanda CoxI mean, that's not entirely part of.
Moritz StefanerThe whole experience for you.
Amanda CoxI'm exaggerating when I say that, but some of all of those things, a lot of my best work probably really is commodity data. So I don't know what data stories episode. It was the video one where you talked about Gregor, about 3d yield curves. That data is all just sitting on the Internet. I still think that's a cool chart. And so, you know, I'm being unfair when I say data on the Internet isn't cool, but yeah.
Moritz StefanerYeah. And Lynn had a second tweet. Another big question. What are the current hard problems you see in viz, beyond visualizing uncertainty, which is sort of the default answer to that?
What are the current hard problems you see in viz AI generated chapter summary:
By default, the really easy stuff for us is the questions about what and where and when. But the why and the how questions, I think, are always more interesting and important. Maybe you need to think more in terms of illustration and explanation and storytelling.
Moritz StefanerYeah. And Lynn had a second tweet. Another big question. What are the current hard problems you see in viz, beyond visualizing uncertainty, which is sort of the default answer to that?
Amanda CoxYeah. I mean, I think in part forcing ourselves to work on problems that actually matter is not the easiest thing. And I was trying to think about why that is any different for visualization than for anyone else in the world. Because they're hard. Because whatever. But I think there is something interesting about the visualization and that by default, the really easy stuff for us is the questions about what and where and when. Visualization of different types is really good at answering those kind of questions. But the why and the how questions, I think, are always, like, way more interesting and important. But in visualization, those kind of devolve into unfun things because they're either, you know, they're estimates then, so you don't resonate with what the units are or whatever. And so I think there's some. But the, the why and the how questions are always way better questions, you know, questions that have consequences and questions that matter to the world. And so I don't know, there's some like eyes on the prize thing. I think that is difficult about visualization because, you know, the, ends up that the, you know, the best answer for guiding policy or whatever is really the best form for that is a, it's a bar chart or something, you know, with some uncertainty bands on it. But that's not necessarily the, you know, a two line bar chart is not like, doesn't, you know. Yeah, this is, by the way.
Enrico BertiniOh, sorry. Sorry.
Amanda CoxNo, go ahead.
Enrico BertiniI was just saying that this is, by the way, where people fight the most. Right. On why and how things happen. Right.
Amanda CoxYeah. I mean, because it's, you know, there's, there's interpretation problems and there's estimation problems. But those are really the interesting questions, I think.
Moritz StefanerYeah. And I mean, in many ways, maybe you need to think more in terms of illustration and explanation and storytelling and, you know, these types of things. If you. Yeah, the more you go into these, these how and why.
Amanda CoxSure. But then it's just this mushy thing, like this mushy universe that you invented in your head and there's like arrows flowing from who knows where to who knows where that, like, you know, has no, like, grounding in anything other than.
Moritz StefanerExactly. And this is maybe why we dodged this a bit and just concentrate on the what? Because there we have. It's more like the home turf.
Amanda CoxYeah, it is what it is. Yeah.
What Does It Mean for Work to Be relevant? AI generated chapter summary:
How can we make our work actually relevant? Yeah, I think it's a question people struggle with. How can we work on problems that, you know, actually change, change thinking or change behavior or help us make better decisions?
Moritz StefanerSo you mean the hard problems, actually, to be relevant. Could you actually say that, like, struggling with this? Okay, we can plot a lot of data. We can actually, like, analyze lots of data, but how. How can we make our work actually. Actually relevant?
Amanda CoxYeah, I think so. You know, I think it's a question people struggle with. And, you know, people are always pointing to, like, Jon Snow or something. He, you know, he didn't fix cholera with his map. That was marketing material. You know, totally. Like, he knew the answer and then he needed some marketing material. And so the, you know, the question is how can we. How can we work on problems that, you know, actually change, change thinking or change behavior or help us make better decisions or that, you know, eyes on the prize, kind of like, how do we work on stuff that's important, issues that are actually important as opposed to gravitate towards data that's readily available on the Internet or easy to, like, Fitzroy forums. We're interested. Well, whatever.
Has Visualization Had an Impact on People's Lives? AI generated chapter summary:
The question of impact is always a tricky one because I am pretty sure that there are lots of things happening out there that we are just not aware of. In a lot of organizations and even among scientists, there are things happening that are at least partially done through visualization that have a strong impact.
Enrico BertiniSo do you know, do you have any visualization in mind that can either come from New York Times or other sources that you think had some degree of impact on people's lives?
Amanda CoxI'm going to go back to my, you know, the Google map. Right. Like, and the driverless cars that are going to result as exhaust from the Google map. I think it's going to save hundreds of thousands of people's lives.
Enrico BertiniYeah.
Amanda CoxSo it's not the actual visualization. It's not the looking at the Google map that doesn't.
Enrico BertiniYeah. The question of impact is always a tricky one because I am pretty sure that there are lots of things happening out there that we are just not aware of. Right. So, I mean, in a lot of organizations and even among scientists, there are lots of things happening that are at least partially done through visualization that have a strong impact, but we just don't know it.
Amanda CoxRight, sure. And I think that probably the visualization that they're doing, not in all cases, but in many cases is not going to be. Is going to be the type of, like, you know, four lines. Four lines on a chart or that, you know, the most important sort of work is not always the most interesting visualization, I think.
Enrico BertiniOh, yeah, yeah, yeah, yeah. No, I have to say, I had a revealing moment some time ago. I think one year ago or so, I was giving a presentation here at NYU, and one of my colleagues from, I think he's a computer scientist, biologist. I was arguing that we don't know whether visualization is having an impact in some domains and so on, especially in science. Right. He was raising his hand and saying, wait a minute, I can give you so many examples in biology where visualization was part of the process and had a huge impact on people's discoveries and communication of scientific knowledge. So it's definitely not true. It's more like that. We, we just don't see these things.
Amanda CoxYeah.
What are your favorite design tools? AI generated chapter summary:
The tools market is really evolving. I'm going to stick with r across the spectrum because it's time for more. Some of the most interesting tool work is going on right now. Maybe we should have a show on all the new web based tools popping up.
Enrico BertiniI think we have one last question. Right. So we have one from Petra Eisenberg. I'm not totally sure I understand the question completely. So Petra asks favorite design tools in the spectrum of data stories that offer free interpretation to unambiguous messages.
Amanda CoxSo I'm choosing to interpret that. And we are sorry, Petra, if we're wrong about what are the favorite design tools, ranging from kind of more exploratory graphics to more explanatory graphics. And I'm going to stick with r across the spectrum because it's time for more. It's to learn. I can't think of anything better at any point other than the very, very, very final end. I also, I think, you know, in the times there was a part where the explorer graphic, there was a phase three or four years ago where the explorer graphic was kind of popular. It was like we can put some filters and some sliders and some something. And that phase, I think we've grown out of that phase. We've grown up a little bit or phones have caused us to just disregard it a little bit. So I think we're, it's possible that I'm going to miss the exploratory tools just because we don't do very much of that work anymore for a variety of reasons.
Moritz StefanerYeah. And also the tools market is really evolving. But as we said before, everybody has their tools that they are fluent with and that's such a huge value. If you have an environment that you're happy with and you can just work is such a big value. So I also feel like, yeah, we might be missing lots of great tools, so maybe we should have one day a show on all the new web based tools popping up and maybe we discover something new.
Amanda CoxWho knows though, whenever, you know, academics in particular ask me what should they learn, what should they, I always say, like whatever it is that you're always already using. So I really hate context switching. So like people is a Stata, people make your graphs in Stata. And if you are people are our people, do it in r. And if your people are web development people just use your sketch in D3 or what? You know, like whatever. I feel like wherever you already are is the best tool.
Moritz StefanerSo I won't learn r?
Amanda CoxNo, I don't know. I think in some ways some of the most interesting tool work that's going on right now, and this is also something that I should have said that different about the upshot earlier is I think we're integrating words not in a fancy design way, but just like, here's a paragraph, here's a chart, here's a paragraph, here's a chart. And so like the stuff that came out a couple weeks ago, I think, from, I'm going to say it wrong, Bestiario or whatever, like that quadrogram maybe it's called that idea that like we're really just using expository tools. And so you should be able to write your paragraph and dump your charts inside of it and then write your next paragraph and make a chart. And it shouldn't be like a bunch of switching or, you know, it shouldn't feel aggressively different. I think those kind of tools are interesting. You know, I know someone in, in Jeff Heer's group at the University of Washington who's working on more kind of Google Doc style creation of charts. So some of the collaborative stuff, like, not sort of like GitHub tag team kind of merge conflict, that will fix it, but real more Google Docs, the barriers over the lower end, it feels a little bit more collaborative. That stuff, I think is also interesting in the tools space.
Moritz StefanerYeah, that's true. And Quadrigram.com, I found that really interesting too, because it offers simple charts, but it has this idea of, there's a document and the document consists of a mixture or sequence of simple blocks. Right. This is something I think we all now learn to appreciate with all the mobile devices and, you know, and all these different consumption situations. And we used to do this huge, monolithic, super detailed blocks. Yeah. That are super integrated, which is nice too, like, you know, double spread print page or something. But now we have to think more about how we can, how can we chop it up into sequences of little blocks. And quadrogram has a really interesting model there. So. Yeah, we should talk about that one maybe as well. Do you know how the tool from Jeff Heer's group is called? Is that something that's available or is it currently being.
Amanda CoxI think it's someone's research project. I think it's someone's dissertation.
Moritz StefanerSo it will come out at some point. Maybe.
Amanda CoxYeah, maybe.
Moritz StefanerYeah. Sounds good too. Great. I think we have to wrap it up. We are good. Yeah. Over an hour was great talking to you.
Amanda CoxCut something.
Moritz StefanerWe'll leave it all in there. Yeah, it was all good. Cool. Thanks so much.
Amanda CoxThanks for having me. Thank you.
Moritz StefanerYeah, thanks.
Enrico BertiniThanks, Amanda.
Moritz StefanerBye bye.
Enrico BertiniBye.
Amanda CoxBye bye.
Enrico BertiniData stories is supported by Tableau software, helping people see and understand their data. Get answers from interactive dashboards. Wherever you go for your free trial, visit Tableau software@Tableau.com. Datastories this is Tableau.com Datastories.