TL, DR Sometimes you need to calculate averages, minimums, or other metrics from numeric-only columns in a Pandas DataFrame. Here a few snippets to do it. Select numeric columns and calculate metrics You often want to calculate summary statistics like the mean , median , or standard deviation — but you only care about numeric… Continue reading Calculate metrics on numeric columns only for Pandas DataFrames
Author: randomds
Use rsdmx to access data
TL, DR rsdmx is a great R package to access data via SDMX. Sometimes the configuration of parameters to access the data may not be intuitive. Here a quick note from few hours spent trying to access data from ISTAT. Install and import rsdmx This is easily achieved opening R or Rstudio and calling: For… Continue reading Use rsdmx to access data
Fix GSConnect not mounting Android phone in the file system
TL, DR GSConnect is a great open source app to connect your phone to your Linux PC. At the time of writing, when you try to mount the phone file system on your PC, you are most likely to get an error. Here a few steps to fix this issue. Mounting your phone from GSConnect… Continue reading Fix GSConnect not mounting Android phone in the file system
Fix Flatpak browser not opening links in Ubuntu
TL, DR Sometimes Ubuntu stages a coup, setting the Snap internet browser as default. This may break your setup if you actually instralled your browser using Flatpak. Here a few steps to fix this issue. Flatpak vs. Snap Ubuntu has a strong preference for Snaps…but maybe you don’t necessarily agree with this and you prefer… Continue reading Fix Flatpak browser not opening links in Ubuntu
Use Python Selenium with Snap browsers
TL, DR Selenium is one of the main libraries for browser automation and web scraping. Sometimes it is painful to integrate with browsers that are installed as Snap packages, this guide gives you a few examples for a correct configuration. What are Snap packages? Snap packages is the new and preferred way to distribute applications… Continue reading Use Python Selenium with Snap browsers
Set your User Agent with Python Requests, Scrapy, and Selenium
TL, DR When you crawl the web to collect data you should set a User Agent that identifies you. Or one that hides the tool you are using. Here you can find how to set the User Agent in Python Requests, Scrapy, and Selenium. What is the User Agent? A User Agent is a string… Continue reading Set your User Agent with Python Requests, Scrapy, and Selenium
Deploy Minio with Docker on a Synology NAS with a custom domain name
TL, DR Minio is a powerful open source self-hosted object storage, similar to S3, that you can deploy on your own hardware or cloud instance. This guide provide you the right settings to deploy Minio with Dover on a Synology NAS and use your domain name with SSL Minio object storage We already covered Minio… Continue reading Deploy Minio with Docker on a Synology NAS with a custom domain name
MongoDB aggregation: save results in a Pandas DataFrame
TL, DR MongoDB is one of the leading NoSQL databases, and its aggregation framework enables powerful queries, as well as data operations. We will see how to save results from aggregation pipelines into a Pandas DataFrame. From MongoDB to Pandas I already provided an introduction to MongoDB and Compass in a previous post for my… Continue reading MongoDB aggregation: save results in a Pandas DataFrame
MongoDB aggregation: match a field with values in a list
TL, DR MongoDB is one of the leading NoSQL databases, and its aggregation framework enables powerful queries, as well as data operations. We will see how to match a field with values in a list to help you select. Matching values in a list I already provided an introduction to MongoDB and Compass in a… Continue reading MongoDB aggregation: match a field with values in a list
MongoDB aggregation: create your data pipelines
TL, DR MongoDB is one of the leading NoSQL databases, and its aggregation framework enables powerful queries, as well as data operations. We will see how to quickly create data pipelines and create code you can use in your projects. The power of a Compass You use MongoDB as data repository because it’s so simple… Continue reading MongoDB aggregation: create your data pipelines