Veille de CCM Benchmark Group

Building and deploying MySQL Raft at Meta

  We’re rolling out MySQL Raft with the aim to eventually replace our current MySQL semisynchronous databases.  The biggest win of MySQL Raft was simplification of the operation and making MySQL servers take care of promotions and membership. This gave th

<engineering.fb.com>
By dam75 - - (Open) Data
ScrapeOps - The DevOps Tool For Web Scraping. | ScrapeOps

ScrapeOps is a devops tool for web scraping which enables you to easily monitor, analyse and schedule your scraping jobs.

State of web scraping 2022 <scrapeops.io> Et le form pour 2023 <docs.google.com>
By xavierccm - - (Open) Data
State of web scraping 2022 <ht

State of web scraping 2022 <scrapeops.io> Et le form pour 2023 <docs.google.com>
By xavierccm - - (Open) Data
Comment Activision a scalé My

Comment Activision a scalé MySQL (avec Vitess) <static.sched.com>
By dam75 - - (Open) Data
dbt - Transform data in your warehouse

dbt is a data transformation tool that enables data analysts and engineers to transform, test and document data in the cloud data warehouse.

<www.getdbt.com> : transfo / versionning / de data : une sorte d'ELT mais code-oriented. Il est de plus en plus utilisé, en remplacement notamment de Talend (le glisser-déplacer est moins à la mode :wink: )
By dam75 - - (Open) Data
GitHub - ArroyoSystems/arroyo: Arroyo is a distributed stream processing engine written in Rust

Arroyo is a distributed stream processing engine written in Rust - GitHub - ArroyoSystems/arroyo: Arroyo is a distributed stream processing engine written in Rust

<github.com>
By xavierccm - - (Open) Data
OneSchema CSV Importer | The embeddable CSV Importer for Developers

Product and engineering teams use OneSchema to save months of development time building a CSV importer. OneSchema improves customer activation / import completion rates by automatically correcting customer data.

<www.oneschema.co> Import CSV data 10x faster, outil dédié à de l'import de CSV et correction d'erreurs
By xavierccm - - (Open) Data
Your wiki, docs & projects. Together.

A new tool that blends your everyday work apps into one. It's the all-in-one workspace for you and your team.

Notion.so se met également à l'IA en menu contextuel en proposant des améliorations d'un texte ou génération d'un nouveau :
By gmarchault - - (Open) Data
Baserow et NocoDB, deux alternatives open source à Airtable

Le modèle exclusivement SaaS d'Airtable et ses limites techniques peuvent conduire à choisir des alternatives open source. Dans ce domaine, deux solutions ressortent sur le marché.

On n'est jamais mieux servis que par soi-même donc : <www.journaldunet.com>
By dam75 - - (Open) Data
CloudQuery | An open source high performance data integration platform for developers | CloudQuery

Sync any source to any database, transform and visualize

Un ELT (et non pas ETL ^^) open-source (en GO) <www.cloudquery.io>
By dam75 - - (Open) Data
A hassle-free web scraper to extract without getting blocked

The easiest web scraper with proxy rotation, infinite pagination, real browsers using JavaScript, and a built in scheduler.

<mrscraper.com>
By dam75 - - (Open) Data
Le palmarès des communes d'Île-de-France où vivre et vieillir en bonne santé

EXCLUSIF - Espérance de vie, prévalence de maladies graves et de cancers, qualité de l'air, densité de médecins, pratique sportive… Le Figaro a comparé les données des 1268 communes d'Île-de-France pour trouver celles où l'on vit en meilleure s

<www.lefigaro.fr>
By cvince - - (Open) Data
Extracting, converting, and querying data in local files using clickhouse-local

Learn how you can use clickhouse-local to analyze and transform your local and remote files using just the power of SQL on your laptop

<clickhouse.com> Clickhouse local, un outil dédié à l'analyse de CSV à l'aide de SQL. Ça n'est pas le premier outil du genre mais il semble particulièrement simple à exploiter et plutôt perfo
By xavierccm - - (Open) Data
5 PHP scraping libraries in 2023.

Web Scraping is evolving every day. There are new methods and new libraries every day for making web scraping fun and easy. First, let’s…

<scrapingking.medium.com>
By vleborgne - - (Open) Data
Home

Kùzu is an embeddable property graph database management system designed for performance, scalability, and ease of use in applications.

<kuzudb.com>
By bmeli - - (Open) Data
How many yottabytes in a quettabyte? Extreme numbers get new names

Prolific generation of data drove the need for prefixes that denote 1027 and 1030.

<www.nature.com>
By bmeli - - (Open) Data
24 Hours of Global Air Traffic

Explore all the air traffic that happens around the world in a single day.

<24h-global-air-traffic.cbergillos.com>
By bmeli - - (Open) Data
Ouyapacours

Où manque-t-il des professeurs ? Cherchez votre département / commune sur <ouyapacours.fcpe.asso.fr>
By cvince - - (Open) Data