• This post was written by Vanessa Sochat, a Research Software Engineer for Stanford Research Computing and the Stanford School of Medicine. She is the primary developer at Stanford for Singularity, a driver to bring Research Applications support to Stanford, and lead developer of Singularity Hub, and Singularity Registry both frameworks optimized for deployment of container-based workflows and "science as a service" capabilities. This piece was originally posted on her personal blog here. Data sharing is hard, but we all know that ...
  • Let me begin by introducing myself: My name is Martin. I'm an astrophysics postdoc working on understanding exploding stars in nearby galaxies. From the very beginning of my studies, I was using data analysis to try to unveil the mysteries of the universe. From deep images taken with ground- and spaced-based telescopes, through time series measuring the heartbeats of extreme stars, to population correlations probing the fundamental physics behind incredibly powerful eruptions: learning the secrets of a complex cosmos requires ...
  • Welcome to Kaggle Data Notes! Enjoy these new, intriguing, and overlooked datasets and kernels:   1. ⚽Predict the World Cup 2018 Winner (link) 2. 🐍 R vs Python Usage from Developer Survey Results (link) 3.  Breast Cancer Analysis and Diagnosis (link) 4. 🎤 Generate Kanye West Lyrics using Markov Chains (link) 5. 📚 NLP with Ethereum Developer Survey Data (link) 6. 💰 Analysis of GitHub’s Corporate Acquisition (link) 7. 🔪 Criminal Complaints in New York City (link) 8. 🏥 Kaggle Dataset #1: Medical Costs (link) 9. 🤹 Kaggle ...
  • As Kaggle’s moderating data scientist for the Data Science Bowl, I’m fortunate to have met first-time competitor Nicole Finnie. Her team (Unet Nuke) impressively ranked within the top 2%, earning Nicole a silver medal. More impressively, I learned that Nicole had no ML/DS experience just a year ago, and picked up these new skills through online classes during her recent maternity leave. As an expectant mother, I found Nicole’s story inspiring and am excited to share it with the broader ...
  • We have a new #1 on our leaderboard – a competitor who surprisingly joined the platform just two years ago. Shubin Dai, better known as Bestfitting on Kaggle or Bingo by his friends, is a data scientist and engineering manager living in Changsha, China. He currently leads a company he founded that provides software solutions to banks. Outside of work, and off Kaggle, Dai’s an avid mountain biker and enjoys spending time in nature. Here’s Bestfitting: Can you tell us ...
  • We’re building Kaggle into a platform where you can collaboratively create all of your data science projects. This past quarter, we’ve increased the breadth and scope of work you can build on our platform by launching many new features and expanding computational resources. It is now possible for you to load private datasets you’re working with, develop complex analyses on them in our cloud-based data science environment, and share the project with collaborators in a reproducible way. Upload private datasets ...
  • This is a guest post written by Kaggle Competition Master and  part of a team that achieved 5th position in the 'Planet: Understanding the Amazon from Space' competition, Indra den Bakker. In this post, he shares the journey from Kaggle competition winner to start-up founder focused on tracking deforestation and other forest management insights. Back in the days, during my studies I was introduced to Kaggle. For the course ‘Data Mining Techniques’ at VU University Amsterdam we had to compete in the ...
  • Data science is an exciting and nebulous field without one clear pathway to success. This infographic series shares the personal stories of individuals who've taken untraditional paths to successful careers in the industry. Find out how Jesse Mostipak went from a Girl Scout camp counselor to a professional data scientist for the Girl Scouts!
  • Kaggle Datasets API TutorialHave you used Kaggle's beta API to download data or make a competition submission? We're pleased to announce version 1.1 of the API which includes new features for easily managing your datasets on Kaggle from the command line. Read on to learn how to use the API to create and update datasets or check out detailed documentation on our GitHub page. Create new datasets » After you follow the installation instructions, it's simple to create a new dataset on Kaggle ...