[RFC,00/21] Concept for tightly coupled package manager (Node.js, Go, Rust)

Message ID	20241220112613.22647-1-stefan.herbrechtsmeier-oss@weidmueller.com
Headers	show Return-Path: <stefan.herbrechtsmeier-oss@weidmueller.com> ip: 40.107.20.104, mailfrom: stefan.herbrechtsmeier-oss@weidmueller.com) From: Stefan Herbrechtsmeier <stefan.herbrechtsmeier-oss@weidmueller.com> To: bitbake-devel@lists.openembedded.org CC: Stefan Herbrechtsmeier <stefan.herbrechtsmeier@weidmueller.com> Subject: [RFC PATCH 00/21] Concept for tightly coupled package manager (Node.js, Go, Rust) Date: Fri, 20 Dec 2024 12:25:51 +0100 Message-ID: <20241220112613.22647-1-stefan.herbrechtsmeier-oss@weidmueller.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0
Series	Concept for tightly coupled package manager (Node.js, Go, Rust) \| expand [RFC,00/21] Concept for tightly coupled package manager (Node.js, Go, Rust) [RFC,01/21] tests: fetch: update npmsw tests to new lockfile format [RFC,02/21] fetch2: npmsw: remove old lockfile format support [RFC,03/21] tests: fetch: replace [url] with urls for npm [RFC,04/21] fetch2: do not prefix embedded checksums [RFC,05/21] fetch2: read checksum from SRC_URI flag for npm [RFC,06/21] fetch2: introduce common package manager metadata [RFC,07/21] fetch2: add unpack support for npm archives [RFC,08/21] utils: add Go mod h1 checksum support [RFC,09/21] fetch2: add destdir to FetchData [RFC,10/21] fetch: npm: rework [RFC,11/21] tests: fetch: adapt style in npm(sw) class [RFC,12/21] tests: fetch: move npmsw test cases into npmsw test class [RFC,13/21] tests: fetch: adapt npm test cases [RFC,14/21] fetch: add dependency mixin [RFC,15/21] tests: fetch: add test cases for dependency fetcher [RFC,16/21] fetch: npmsw: migrate to dependency mixin [RFC,17/21] tests: fetch: adapt npmsw test cases [RFC,18/21] fetch: add gosum fetcher [RFC,19/21] tests: fetch: add test cases for gosum [RFC,20/21] fetch: add cargolock fetcher [RFC,21/21] tests: fetch: add test cases for cargolock

Message ID

20241220112613.22647-1-stefan.herbrechtsmeier-oss@weidmueller.com

Headers

From: Stefan Herbrechtsmeier <stefan.herbrechtsmeier-oss@weidmueller.com>
To: bitbake-devel@lists.openembedded.org
CC: Stefan Herbrechtsmeier <stefan.herbrechtsmeier@weidmueller.com>
Subject: [RFC PATCH 00/21] Concept for tightly coupled package manager
 (Node.js, Go, Rust)
Date: Fri, 20 Dec 2024 12:25:51 +0100
Message-ID: 
 <20241220112613.22647-1-stefan.herbrechtsmeier-oss@weidmueller.com>
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1
X-MS-Exchange-AntiSpam-MessageData-0: 
 9iXLHGdriBkxpljmsMGEo9HXDTQtr1gQ5ycY8qwsVlN+zdFeYv6C1yD/E6CLYSxOrD3LBV+CP+PDPlBIOZJQVEroQv9eGM130h4aSjCLtR07KQMLsC+ZA3+rLyah1OemPMkmH8bGLL1XgmlThB/rJXGk/JR+Ez1vnwL1WZ5f3VNXvHZoOyeJG+XktFo5UOrl4L1fxDII709PvOJqEOCgjMagkMMZa5pPEu9kZOLk4NZ2tYLODOF9QHylCnJ2G+xJ4Bk4t4VOGguPP9xz8jDL3Soi95Kf+K2PmIaH0SDKp2Y1/dmIoUaI9Y9Bism2uUqmHZE8MagKQoXrPEKgv9atrmYNv1rr6lYhuzBkDJz1FsHl9asKcSBgmh3ZM6uBo+AAM7UzOt/2fnm8g0KzFhRNEZtFjcug+EnXGV2m8QhsYpDCiOPooo7Q9QWx1i/LIXhZCqcJS1M3sd7hCVhg6OXmxIr5ucNW/Gau2Seo1IutygTLi/cjltiES1c86kDfWNQ/IoOj3GJD3SCgZQ2C1KY0HaUawr9furE/IoeTfdBCfgENjSiRSjutj/QGSgImMbb3MLMKPZk62l70kh3CnS6vO5I0/Z6E9Kxij6OcqBM0BhqlwHd/CguN7V9FAqQPMm2wFWCD2cfNVSpuPq6YiHY794plA0ZkyItpAswFT2G2cP7Cl+tuDlTUEtaAW9ji2qJRhxwXDVOCFvykIWXXSHMhfo1B6EYmtu1Pi+rxUiBAUwx8YYgux8Eeqdg6ArLyXtZyOnBflWsFFKaf65AyXHo9a9wgO1LV80CxC2EDIrEXQlDVxbmp5w2bDAygpnfq00AZvO03lnrRa2P1QhV7/1wBK4Zo1qo7z4TYnmSkPCC6x01ma4lB2gRRl2oE2pI7NPaXMcUca1HIvPa0OuZUbMrnolG31gnTQURy/MX2YD08pqksB9VXpXdD24Cd05Xv0SBR7wO2fzOYirkoX9zNBSKTugxe/W+SMUvEgm2/bpo0OkEhd5ZSJ6IYnDmbwBgyVp0uixBFmecFX2ivZZZnVD6WSyh+isZ5orYMyqMI/vH6LPTh6xHWD1UCbYed0xwc6uX69i8KHFexxIe7IRdEhkzDSqpPlBXAlglwBI4H1OMz3CuaPYP6e7ExCFn+lrzUvQPgqEsi9Eml5tC83xc7xg4GdXYTbC18CNyBv6Tmav47n8E4DATVbxok81pvsfNP+RzkXeFi2EjersvBdxELhrUmDJLK2EIGmPKuo41XXq8FlKgz1U3jLXQPR6k8zjtv/x95wzxPhwAVKG8JB6V22uw3ovPIJ/frXYKf45U/XB+ktLLauHFYmf2wrJz9zH0/hKCE6ig/TPPCHryP//gJFhwGDw8tStev6wHiWdqTbA3Dv2W2nr9ra1QdPxPnrfoS8DVD7WkMZq1/y59sLtjDpCVZ/oBSxSoXVMZJSJ5WW6Q5fozYVMSsvPh/dYVGExQrJEaEvao4tQfuvSPu9UCgOVEmv6xPt60xAjkvW4ggPKEV1rEDS0nHn7EWjOGeV4oT9ms4hSHQKoGCUEsERW/QEJs3tL8XZNBENeeznfmtU0Hww6S01OKosvVCOa8tnlUQZ7S7rz75GjQKHxUeRerwF74cTg==
X-OriginatorOrg: weidmueller.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 
 2393a1e7-1bb4-4b18-0ddd-08dd20e922d9
X-MS-Exchange-CrossTenant-AuthSource: PAXPR08MB6969.eurprd08.prod.outlook.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 20 Dec 2024 11:26:24.2179
 (UTC)
X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted
X-MS-Exchange-CrossTenant-Id: e4289438-1c5f-4c95-a51a-ee553b8b18ec
X-MS-Exchange-CrossTenant-MailboxType: HOSTED
X-MS-Exchange-CrossTenant-UserPrincipalName: 
 qAxdxdfnXztjWFN3Mqn+FqyvT82FGfWm7Ux8/+XtUuCDjzilFfir3nITbtUgwR4y2u5t4fJTOVuRYEc3u5UJDA==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: DU0PR08MB8256
List-Id: <bitbake-devel.lists.openembedded.org>
X-Webhook-Received: from li982-79.members.linode.com [45.33.32.79] by
 aws-us-west-2-korg-lkml-1.web.codeaurora.org with HTTPS for
 <bitbake-devel@lists.openembedded.org>; Fri, 20 Dec 2024 11:26:41 -0000
X-Groupsio-URL: https://lists.openembedded.org/g/bitbake-devel/message/16920

Series

Concept for tightly coupled package manager (Node.js, Go, Rust) | expand

Message

Stefan Herbrechtsmeier Dec. 20, 2024, 11:25 a.m. UTC

From: Stefan Herbrechtsmeier <stefan.herbrechtsmeier@weidmueller.com>

The patch series improves the fetcher support for tightly coupled
package manager (npm, go and cargo). It adds support for embedded
dependency fetcher via a common dependency mixin. The patch series
reworks the npm-shrinkwrap.json (package-lock.json) support and adds a
fetcher for go.sum and cargo.lock files. The dependency mixin contains
two stages. The first stage locates a local specification file or
fetches an archive or git repository with a specification file. The
second stage resolves the dependency URLs from the specification file
and fetches the dependencies.

SRC_URI = "<type>://npm-shrinkwrap.json"
SRC_URI = "<type>+http://example.com/ npm-shrinkwrap.json"
SRC_URI = "<type>+http://example.com/${BP}.tar.gz;striplevel=1;subdir=${BP}"
SRC_URI = "<type>+git://example.com/${BPN}.git;protocol=https"

Additionally, the patch series reworks the npm fetcher to work without a
npm binary and external package repository. It adds support for a common
dependency name and version schema to integrate the dependencies into
the SBOM.

= Background
Bitbake has diverse concepts and drawbacks for different tightly coupled
package manager. The Python support uses a recipe per dependency and
generates common fetcher URLs via a python function. The other languages
embed the dependencies inside the recipe. The Node.js support offers a
npmsw fetcher which uses a lock file beside the recipe to generates
multiple common fetcher URLs on the fly and thereby hides the real
download sources. This leads to a single source in the SBOM for example.
The Go support contains two parallel implementations. A vendor-based
solution with a common fetcher and a go-mod-based solution with a gomod
fetcher. The vendor-based solution includes the individual dependencies
into the SRC_URI of the recipe and uses a python function to generate
common fetcher URLs which additional information for the vendor task.The
gomod fetcher uses a proprietary gomod URL. It translates the URL into a
common URL and prepares meta data during unpack. The Rust support
includes the individual dependencies in the SRC_URI of the recipe and
uses proprietary crate URLs. The crate fetcher translates a proprietary
URL into a common fetcher URL and prepares meta data during unpack. The
recipetool does not support the crate and the gomod fetcher. This leads
to missing licenses of the dependencies in the recipe for example
librsvg.

The steps needed to fetch dependencies for Node.js, Go and Rust are
similar:
1. Extract the dependencies from a specification file (name, version,
   checksum and URL)
2. Generate proprietary fetcher URIs
  a. npm://registry.npmjs.org/;package=glob;version= 10.3.15
  b. gomod://golang.org/x/net;version=v0.9.0
     gomodgit://golang.org/x/net;version=v0.9.0;repo=go.googlesource.com/net
  c. crate://crates.io/glob/0.3.1
3. Generate wget or git fetcher URIs
  a. https://registry.npmjs.org/glob/-/glob-10.3.15.tgz;downloadfilename=…
  b. https://proxy.golang.org/golang.org/x/net/@v/v0.9.0.zip;downloadfilename=…
     git://go.googlesource.com/net;protocol=https; subdir=…
  c. https://crates.io/api/v1/crates/glob/0.3.1/download;downloadfilename=…
4. Unpack
5. Create meta files
  a. Update lockfile and create tar.gz archives
  b. Create go.mod file
     Create info, go.mod file and zip archives
  c. Create .cargo-checksum.json files

It looks like the recipetool is not widely used and therefore this patch
series integrates the dependency resolving into the fetcher. After an
agreement on a concept the fetcher could be extended. The fetcher could
download the license information per package and a new build task could
run the license cruncher from the recipetool.

= Open questions

* Where should we download dependencies?
** Should we use a folder per fetcher (ex. git and npm)?
** Should we use the main folder (ex. crate)?
** Should we translate the name into folder (ex. gomod)?
** Should we integrate the name into the filename (ex. git)?
* Where should we unpack the dependencies?
** Should we use a folder inside the parent folder (ex. node_modules)?
** Should we use a fixed folder inside unpackdir
   (ex. go/pkg/mod/cache/download and cargo_home/bitbake)?
* How should we treat archives for package manager caches?
** Should we unpack the archives to support patching (ex. npm)?
** Should we copy the packed archive to avoid unpacking and packaging
   (ex. gomod)?

This patch series depends on patch series
20241209103158.20833-1-stefan.herbrechtsmeier-oss@weidmueller.com
("[1/4] tests: fetch: adapt npmsw tests to fixed unpack behavior").


Stefan Herbrechtsmeier (21):
  tests: fetch: update npmsw tests to new lockfile format
  fetch2: npmsw: remove old lockfile format support
  tests: fetch: replace [url] with urls for npm
  fetch2: do not prefix embedded checksums
  fetch2: read checksum from SRC_URI flag for npm
  fetch2: introduce common package manager metadata
  fetch2: add unpack support for npm archives
  utils: add Go mod h1 checksum support
  fetch2: add destdir to FetchData
  fetch: npm: rework
  tests: fetch: adapt style in npm(sw) class
  tests: fetch: move npmsw test cases into npmsw test class
  tests: fetch: adapt npm test cases
  fetch: add dependency mixin
  tests: fetch: add test cases for dependency fetcher
  fetch: npmsw: migrate to dependency mixin
  tests: fetch: adapt npmsw test cases
  fetch: add gosum fetcher
  tests: fetch: add test cases for gosum
  fetch: add cargolock fetcher
  tests: fetch: add test cases for cargolock

 lib/bb/fetch2/__init__.py   |  35 +-
 lib/bb/fetch2/cargolock.py  |  73 +++
 lib/bb/fetch2/dependency.py | 167 +++++++
 lib/bb/fetch2/gomod.py      |   5 +-
 lib/bb/fetch2/gosum.py      |  51 +++
 lib/bb/fetch2/npm.py        | 244 +++-------
 lib/bb/fetch2/npmsw.py      | 347 ++++----------
 lib/bb/tests/fetch.py       | 880 +++++++++++++++++-------------------
 lib/bb/utils.py             |  25 +
 9 files changed, 916 insertions(+), 911 deletions(-)
 create mode 100644 lib/bb/fetch2/cargolock.py
 create mode 100644 lib/bb/fetch2/dependency.py
 create mode 100644 lib/bb/fetch2/gosum.py