Initial statically linked clang image #5

nickdesaulniers · 2022-05-11T04:56:56Z

This image still has object files built with GCC (from library
dependencies). But it's a start to start rebuilding dependencies from
scratch.

This is less asking for code review and more so checking that my "tagging" in build.sh is correct (not sure where to put version info, if any), directory structure (llvm-project/ with multiple Dockerfiles for different stages?), and more so a heads up so I can publish this on our Docker registry, then start rebuilding dependencies from scratch.

Link: ClangBuiltLinux/tc-build#150

llvm-project/build.sh

nickdesaulniers · 2022-05-11T05:00:07Z

The goal with this bootstrap is that it goes in the garbage ASAP; it's just enough to start building dependencies from source.

nathanchance

Just some initial comments, hopefully that helps clarify some things!

llvm-project/build.sh

linux/Dockerfile.linux

llvm-project/Dockerfile.bootstrap

linux/Dockerfile.linux

llvm-project/Dockerfile.bootstrap

llvm-project/build.sh

nickdesaulniers · 2022-05-16T22:32:19Z

pushed a few changes, not quite done yet; happy to squash when merging, or force push squashed

@nathanchance

docker produces the following error if the input file is unspecified (via `-f Dockerfile`): unable to prepare context: unable to evaluate symlinks in Dockerfile path: lstat /android2/containers/linux/Dockerfile: no such file or directory Suggested by @nathanchance in #5 (comment)

linux/Dockerfile

llvm-project/stage1.cmake

llvm-project/stage2.cmake

musl/Dockerfile

llvm-project/Dockerfile

.github/workflows/docker.yml

This image still has object files built with GCC (from library dependencies). But it's a start to start rebuilding dependencies from scratch. Link: ClangBuiltLinux/tc-build#150

These are needed by the libc...though the libc is needed for HOSTCC...

This reverts commit fc4ee68. Unnecessary after #6

This reverts commit 8527982. Unnecessary after #6

nathanchance · 2022-05-19T16:17:32Z

Also, I just stumbled across the concept of apk add --virtual, which would make adding and removing dependencies a little easier/clearer:

https://reddit.com/r/archlinux/comments/ut383i/_/i97fgfk/?context=1

nickdesaulniers · 2022-05-19T17:22:37Z

Additionally, we could set up the workflow so that it only pushes the final tag, rather than the intermediate ones.

Right, we can export files between dependant workflows. We do so for CI with the builds.json file. Is that what you were thinking?

Is the plan to add more stages to llvm-project or will that live in a different folder?

So that's something I'm still not certain about. Initially, I was thinking we'd publish the results of this build, then immediately rewrite the Dockerfile to replace the bootstrapping with the newly published container.

But thinking about supporting other hosts (aarch64 in particular), I think we might want to keep the bootstrapping script alive a little longer, even if we use it only once to produce the initial image.

I don't have enough experience with Docker to know whether there are any conventions for what we're trying to do. My initial preference is to use the top level directories of this repo as a namespace mirroring the open source projects we're building, so I guess that would be multiple files (Dockerfiles) or multiple stages to this folder. Perhaps we put things in sub folders under llvm-project/, which I would be fine with, too. Perhaps llvm-project/stage1/Dockerfile, llvm-project/stage2/Dockerfile, llvm-project/stage3/Dockerfile. IDK

The thing I still don't like about docker is multistage builds across multiple distinct Dockerfiles seems to require some orchestration via a shell script to name/tag the resulting container. I want to be able to test the full chain locally (which is easier with one Dockerfile IMO) but not be able to push to the container registry (no one other than CI would be my preference), but also have the CI test these individual pieces as distinct workflows which will give each piece more time to run, and we can better express the dependency chain, it seems.

nickdesaulniers · 2022-05-19T17:40:32Z

Also, I just stumbled across the concept of apk add --virtual, which would make adding and removing dependencies a little easier/clearer:

How is that different from apk del?

nathanchance · 2022-05-19T18:40:47Z

Right, we can export files between dependant workflows. We do so for CI with the builds.json file. Is that what you were thinking?

Yes, exactly. I was thinking about having each container build be its own step but I think we need to make them individual jobs so that we get the time benefits. The Docker folks have a nice example of doing this:

https://github.com/docker/build-push-action/blob/master/docs/advanced/share-image-jobs.md

Perhaps llvm-project/stage1/Dockerfile, llvm-project/stage2/Dockerfile, llvm-project/stage3/Dockerfile

I think this organization makes sense, especially if we are talking about reusing certain parts later.

The thing I still don't like about docker is multistage builds across multiple distinct Dockerfiles seems to require some orchestration via a shell script to name/tag the resulting container.

Right, I think we can just get away with having a script that handles this; we might want to move the linux, llvm-project, and must into a folder like toolchain or something then have a build.sh that we can run that handles doing all the building.

$ tree toolchain
toolchain
├── build.sh
├── linux
├── llvm-project
└── musl

3 directories, 1 file

nathanchance · 2022-05-19T18:42:58Z

Also, I just stumbled across the concept of apk add --virtual, which would make adding and removing dependencies a little easier/clearer:

How is that different from apk del?

Just makes grouping and removing dependencies a little easier, as you could do something like:

$ apk add --virtual musl-deps make musl-dev rsync

$ apk del musl-deps

vs.

$ apk add make musl-dev rsync

$ apk del make musl-dev rsync

nathanchance

I think this is solid enough for now, I can nitpick style until the cows come home :)

compnerd

It would be nice to add the comments on the CMake cache up front, but the rest of it seems reasonable.

nickdesaulniers · 2022-05-19T21:33:48Z

The Docker folks have a nice example of doing this:
https://github.com/docker/build-push-action/blob/master/docs/advanced/share-image-jobs.md

Interesting, I see how this works. So it looks like you can serialize/deserialize a docker container to/from a tarball. It looks like it's just the upload/download artifact pieces we're using else where. I see...

Right, I think we can just get away with having a script that handles this; we might want to move the linux, llvm-project, and must into a folder like toolchain or something then have a build.sh that we can run that handles doing all the building.

SGTM. Perhaps "sysroot" since we will need to distribute clang, lld, other llvm binaries, but also a few compiler supplied headers that are required. At that point, throwing the libc++ headers+libraries wouldn't hurt. Then we have a sysroot that can be the basis of a musl and libc++ based distro. ;)

I'm going to save changing directory structure to a follow up PR where we focus more on the CI wiring.

Just makes grouping and removing dependencies a little easier, as you could do something like:

Ah, yeah this seems a bit more descriptive. I wonder if you can repeat that command to "append" to the running list that a given identifier represents? Because I think I might want to install dependencies as late a possible ("just in time") until when they are absolutely needed, but then clean everything up all at once. Though I do wonder if we can remove some dependencies as soon as they are no longer needed. That might be overkill.

It would be nice to add the comments on the CMake cache up front, but the rest of it seems reasonable.

Sure. I think code review of those to check my decisions are sound would be helpful (even to clarify my own understanding). If we can't explain why a flag should be used, I feel it does not belong. Let me take the time to add some comments in this PR.

Thanks both for all of the good tips; I'm impressed with how much better this looks than my initial commit! :)

llvm-project/stage1.cmake

compnerd · 2022-05-20T17:09:28Z

llvm-project/stage2.cmake

+
+# Use libunwind from stage1.
+# Statically link resulting executable.
+# TODO: why is -lc++abi necessary for static link?


Because static libraries do not carry DT_NEEDED and thus we cannot get the transitive closure for the symbols and will fail to link because lld behaves incorrectly (undefined symbols are encouraged by ELF).

because lld behaves incorrectly (undefined symbols are encouraged by ELF).

"Behaves incorrectly?" Does BFD do something differently/correctly?

Yes, BFD does permit the undefined symbols IIRC. However, re-reading the lines below, the executable should be fully linked and not have undefined symbols - that does make sense (even though it may be different from the default expected semantics).

That said, I think this is more about the expectations and Unix defaults ... its just different and correctness here refers to the fact that it changes the expectations by default - which means that the behaviour is now different which for a drop-in replacement is arguably incorrect.

cc @MaskRay

llvm-project/stage2.cmake

nathanchance · 2022-05-20T17:20:16Z

llvm-project/stage1.cmake

+# TODO: passing in the value of $(clang -print-multiarch) causes failures.
+# It seems that alpine clang's default target triple is x86_64-linux-gnu.
+# Perhaps missing alpine in the triple causes some incompatibility?


I don't think clang -print-multiarch prints the default target triple, it seems like we really want clang -print-target-triple?

On AArch64:

$ podman run --rm -ti docker.io/alpine:edge sh -c "apk add --no-cache clang 2>&1 >/dev/null && clang -print-target-triple" aarch64-alpine-linux-musl

On x86_64:

$ podman run --rm -ti docker.io/alpine:edge sh -c "apk add --no-cache clang 2>&1 >/dev/null && clang -print-target-triple" x86_64-alpine-linux-musl

ah, makes sense the flag for the target triple is named as such! My mistake for trying the wrong one. Let me give that a shot. Thanks for the tip.

idk, I'm still running into issues trying to replace this. Will look further some other time.

Are you trying to replace this within the cmake cache or in the Dockerfile? Happen to have a diff that you tried so I can see if I can reproduce locally?

I'm trying to have cmake .... -D TRIPLE=$(clang -print-target-triple) in the Dockerfile, then set(LLVM_DEFAULT_TARGET_TRIPLE "${TRIPLE}") in the stageX.cmake.

Ah yeah, I don’t think that will work. I think we just ditch trying to preserve that in the cache and just define the value during the cmake invocation. It is not a static value so we shouldn’t try to treat it as such.

cmake -B … -DLLVM_DEFAULT_TARGET_TRIPLE=$(clang -print-target-triple) …

-fuse-ld=lld is already enabled by LLVM_ENABLE_LLD.

mucking with apk add re-untars sources, which takes a while.

nickdesaulniers requested a review from compnerd May 11, 2022 04:56

nickdesaulniers requested a review from nathanchance as a code owner May 11, 2022 04:56

nickdesaulniers commented May 11, 2022

View reviewed changes

llvm-project/build.sh Outdated Show resolved Hide resolved

nickdesaulniers force-pushed the llvm-bootstrap branch from 6c30d26 to 8566635 Compare May 11, 2022 06:11

nathanchance reviewed May 11, 2022

View reviewed changes

compnerd reviewed May 11, 2022

View reviewed changes

linux/Dockerfile.linux Outdated Show resolved Hide resolved

linux/Dockerfile.linux Outdated Show resolved Hide resolved

llvm-project/Dockerfile.bootstrap Outdated Show resolved Hide resolved

llvm-project/build.sh Outdated Show resolved Hide resolved

nickdesaulniers mentioned this pull request May 17, 2022

rename Containerfile to Dockerfile #6

Merged

nickdesaulniers commented May 17, 2022

View reviewed changes

linux/Dockerfile Outdated Show resolved Hide resolved

llvm-project/stage1.cmake Show resolved Hide resolved

llvm-project/stage2.cmake Show resolved Hide resolved

musl/Dockerfile Outdated Show resolved Hide resolved

nickdesaulniers commented May 17, 2022

View reviewed changes

llvm-project/Dockerfile Outdated Show resolved Hide resolved

llvm-project/Dockerfile Outdated Show resolved Hide resolved

.github/workflows/docker.yml Outdated Show resolved Hide resolved

nickdesaulniers added 18 commits May 17, 2022 13:39

Initial statically linked clang image

59758da

This image still has object files built with GCC (from library dependencies). But it's a start to start rebuilding dependencies from scratch. Link: ClangBuiltLinux/tc-build#150

linux headers build

3cf391e

These are needed by the libc...though the libc is needed for HOSTCC...

add musl build

ec03920

change llvm-project tag

f918592

rename to Containerfile; then we don't need -f and CI can build

9243ce3

do the same for linux+musl

bbff912

remove question about rsync

3971a88

make -C

1d8a8d1

for loop symlink

2f367b0

Revert "rename to Containerfile; then we don't need -f and CI can build"

92ac35d

This reverts commit fc4ee68. Unnecessary after #6

Revert "do the same for linux+musl"

d38672f

This reverts commit 8527982. Unnecessary after #6

rename everything

06b9fb3

fix tags

11e20c1

prefer ninja -C

7b1e4b2

stage 1 cmake cache

c3adfb2

stage2 cmake cache

e44efec

rename cmake caches

12336ca

fix llvm-project tag consumers

bb38cf1

nickdesaulniers added 5 commits May 19, 2022 14:14

move llvm src fetch into docker for CI

be614f5

skip mkdir, cmake -B does that

6519376

build stage1 for host

7ede825

use apk info to more precisely delete unwanted binaries

97a92e6

build llvm-ar, cp /usr/local/bin from stage1 to stage2

25fc46e

nickdesaulniers force-pushed the llvm-bootstrap branch from 609c40d to 25fc46e Compare May 19, 2022 21:15

nathanchance approved these changes May 19, 2022

View reviewed changes

compnerd approved these changes May 19, 2022

View reviewed changes

add comments to cmake vars

3a5806e

compnerd reviewed May 20, 2022

View reviewed changes

nathanchance reviewed May 20, 2022

View reviewed changes

nickdesaulniers added 9 commits May 20, 2022 10:23

fix comment about stage1 clang defaults

bc667e7

fix typo

d3a78e8

remove CMAKE_{EXE,SHARED}_LINKER_FLAGS from stage1

218c583

-fuse-ld=lld is already enabled by LLVM_ENABLE_LLD.

move untar-ing sooner in stage1

35e7fe4

mucking with apk add re-untars sources, which takes a while.

turns out, we don't need GNU binutils for stage1

574e90e

move todo comment

fb4402b

update stage2 comment on CMAKE_CXX_FLAGS

4c7b202

use ARG to DRY LLVM_BUILD_DIR

8e0fa0f

update comments

47afa6e

compnerd approved these changes May 20, 2022

View reviewed changes

nathanchance approved these changes May 20, 2022

View reviewed changes

nickdesaulniers merged commit 45106d7 into main May 20, 2022

nickdesaulniers deleted the llvm-bootstrap branch May 20, 2022 21:54

Initial statically linked clang image #5

Initial statically linked clang image #5

Uh oh!

Conversation

nickdesaulniers commented May 11, 2022

Uh oh!

Uh oh!

nickdesaulniers commented May 11, 2022

Uh oh!

nathanchance left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nickdesaulniers commented May 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nathanchance commented May 19, 2022

Uh oh!

nickdesaulniers commented May 19, 2022

Uh oh!

nickdesaulniers commented May 19, 2022

Uh oh!

nathanchance commented May 19, 2022

Uh oh!

nathanchance commented May 19, 2022

Uh oh!

nathanchance left a comment

Choose a reason for hiding this comment

Uh oh!

compnerd left a comment

Choose a reason for hiding this comment

Uh oh!

nickdesaulniers commented May 19, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nathanchance May 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

nickdesaulniers commented May 16, 2022 •

edited

Loading

nathanchance May 20, 2022 •

edited

Loading