DiscoverDear AnalystDear Analyst #125: How to identify Taylor Swift’s most underrated songs using data with Andrew Firriolo
Dear Analyst #125: How to identify Taylor Swift’s most underrated songs using data with Andrew Firriolo

Dear Analyst #125: How to identify Taylor Swift’s most underrated songs using data with Andrew Firriolo

Update: 2024-03-25
Share

Description

Sometimes pop culture and data analysis meet and the result is something interesting, thought-provoking, and of course controversial. How can one use data to prove definitely which Taylor Swift songs are the most underrated? Isn’t this a question for your heart to answer? Andrew Firriolo sought to answer this question over the last few months and the results are interesting (if you’re a Taylor Swift fan). As a Swiftie since 2006 (moniker for Taylor Swift fans), Andrew wanted to find a way to bridge his passions for Taylor Swift and data analysis. He’s currently a senior data analyst at Buzzfeed, and published his findings on Buzzfeed to much reaction from the Swiftie community. In the words of Taylor Swift, Andrew’s methodology and analysis just “hits different.”





<figure class="wp-block-image size-full"></figure>



From comp sci to data analytics





Andrew studied computer science at New Jersey Institute of Technology but realized he liked the math parts of his degree over the engineering parts. Like many guests on this podcast, he made a transition to data analytics. Interestingly, it wasn’t a job that propelled him into the world of data analytics. But rather, going to graduate school at Georgia Institute of Technology (Georgia Tech). GIT has some really affordable online technical programs including data analytics. After getting his master’s degree, he worked at Rolling Stone as a data analyst. This is the beginning of Andrew’s exploration into the Spotify API to see the data behind music. You can see some of the articles Andrew published while at Rolling Stone here.





<figure class="wp-block-image size-full"><figcaption class="wp-element-caption">Source: Pocketmags</figcaption></figure>



After Rolling Stone, Andrew landed his current role at Buzzfeed building internal dashboards and doing internal analysis. In both of his roles, he talks about using a lot of SQL and R. A big part of his job is explaining the analyses he’s doing to his colleagues. This is where the data storytelling aspect of a data analyst’s job comes into play. I call this the “soft” side of analytics but some would argue that it’s the most important part of a data analyst’s job. In most data analyst roles you aren’t just sitting at your desk writing SQL queries and building Excel models. You’re a business partner with other people in the organization communication skills are more important than technical skills.





Answering a Taylor Swift question with data





Andrew became a Taylor Swift fan through his sister in 2006. They both listed to the world premier of Taylor’s first album. Given his background in data, Andrew decided to answer a question about Taylor Swift that’s been on his mind for a while: what are Taylor Swift’s most underrated songs?





<figure class="wp-block-image size-full"></figure>



To read Andrew’s full article, go to this Buzzfeed post.





Andrew’s hypothesis was that there’s a way to use data to prove which songs in Taylor’s discography are most underrated. When I classify something as “underrated,” it’s usually a decision you make with your gut. But it’s always interesting to see the data (and the methodology) for determining if something is truly “underrated.”





Multiple iterations in song streaming analysis





As mentioned earlier, Andrew made good use of Spotify’s API. The API gives you a plethora of information about songs such as how “danceable” or “acoustic” a song is. Each characteristic is measured on a scale of 0 to 1.





For the first iteration of Andrew’s analysis, he simply compared a given song’s streaming performance to the album’s median streaming performance. The hypothesis here is that the less-streamed songs are considered the underrated songs. The result of this analysis was a lot of Taylor’s deluxe tracks.





<figure class="wp-block-image size-full"><figcaption class="wp-element-caption">Source: Genius</figcaption></figure>



The second iteration was to look beyond the streaming performance of the album the song is on. Andrew compared the song’s performance relative to album’s released before and after the current album. This surfaced some more underrated songs.





Getting the opinion of Swifties





While Andrew’s analysis so far yielded some interesting songs, he found that these songs weren’t all that loved by other Swifties.





<figure class="wp-block-image size-full"></figure>



In his final iteration, Andrew implemented a quality score to his analysis. This is a more subjective number that would take into account the opinion of experts.





At Rolling Stone, they had a rolling list of expert opinions that were published in various places. He had a data set of 1,000 opinions on different Taylor Swift songs that he could use to qualify a song. The big question is, how much weight do you give the quality score? In the end, Andrew decided on a weight od 33% to each metric he tracked:






  1. Percent difference between its lifetime Spotify streams and the median streams of its album




  2. Percent difference between its lifetime Spotify streams and the median streams, including neighboring albums




  3. Average of six rankings of Taylor’s discography from media publications (quality score)





The quality score basically took into account the wisdom of the Swifty community.





<figure class="wp-block-image size-full"><figcaption class="wp-element-caption">Source: Know Your Meme</figcaption></figure>



Getting to the #1 most underrated song: Holy Ground (Red)





Andrew was able to use R–a tool he’s already using every day on his job–to do this analysis. After dumping all the data from the Spotify API into a CSV, he used the Tidyverse R packages do crunch the numbers. One of the most commonly used packages for data visualization in Tidyverse is ggplot. But superimposing the images of Taylor Swift’s albums onto the charts created by ggplot was a new script Andrew had to write in R. I asked Andrew if he had to learn any new skills for this Taylor Swift analysis, and the main skill Andrew said he had to learn was data visualization. Here’s an example of a visual from Andrew’s blog post for the #1 most underrated Taylor Swift song:





<figure class="wp-block-image size-full"><figcaption class="wp-element-caption">Source: Republic Records / Tidyverse / Andrew Firriolo / BuzzFeed</figcaption></figure>



To make sure he was on the right track, Andrew asked other Swifties what their #1 most underrated Taylor Swift song was. To Andrew’s delight, two co-workers said Holy Ground. Getting this qualitative feedback let Andrew know he was on the right track.





On the Buzzfeed article, half of the commenters agree that Holy Ground is indeed the most underrated song. The other half talk about other songs that should on the list. When Andrew posted his analysis on LinkedIn, most people commented on his methodology and thought process (like we did in this episode).





Using science to see which re-releases of Taylor’s songs most resemble the original song





Of course, “science” is used a bit loosely here. But similar to Andrew’s underrated song analysis, this analysis utilized the Spotify API to see which Taylor’s Version song most closely matches the original song. This was Andrew’s first analysis on Taylor Swift published late last year.





Read the Buzzfeed article for the full details on the meth

Comments 
loading
In Channel
loading
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Dear Analyst #125: How to identify Taylor Swift’s most underrated songs using data with Andrew Firriolo

Dear Analyst #125: How to identify Taylor Swift’s most underrated songs using data with Andrew Firriolo

Dear Analyst