Scrapy: Select elements containing a specific text

TL, DR Scrapy is one of the top frameworks for web scraping in Python. Often you need to select some element where you already know part of its text. This post shows you how to select elements containing a certain text with CSS and XPATH selectors. How to select elements containing a certain text in… Continue reading Scrapy: Select elements containing a specific text

Collaborative edit for Jupyter Notebooks

TL, DR Jupyter Notebooks are one of the most common tools for data scientist to explore new challenges and prototype solutions. In most cases, this process is a collaborative one. And since a couple of years, it would be quite common that you will interact with your co-workers remotely. Collaborative edit capabilities for Jupyter Notebooks… Continue reading Collaborative edit for Jupyter Notebooks

Import NA strings in Pandas the right way

TL, DR Pandas has great parsing tools to let you import data from multiple file formats (CSV, Excel, etc…). Sometimes in your data you have “NA” strings that are valid values. However, Pandas default will consider those strings as markers for missing values. Below some option you can use to prevent this default and import… Continue reading Import NA strings in Pandas the right way

Google Foobar Challenge level 5 – Expanding Nebula

TL, DR Google Foobar is a hidden coding challenge by Google, and Expanding Nebula is the challenge I solved using Python in order to complete level 5. This post is part of a series on Google Foobar, with all the challenges I encountered. The full series is available here. Getting ready to run The ninth… Continue reading Google Foobar Challenge level 5 – Expanding Nebula