serenity

mirror of https://github.com/RGBCube/serenity synced 2025-07-08 22:37:35 +00:00

Author	SHA1	Message	Date
Andreas Kling	1dc9fa9506	Kernel: Simplify PageDirectory swapping in sys$execve() Swap out both the PageDirectory and the Region list at the same time, instead of doing the Region list slightly later.	2020-01-19 16:05:42 +01:00
Andreas Kling	05836757c6	Kernel: Oops, fix bad sort order of available VM ranges This made the allocator perform worse, so here's another second off of the Kernel/Process.cpp compile time from a simple bugfix! (31s to 30s)	2020-01-19 15:53:43 +01:00
Andreas Kling	167b57a6b7	TmpFS: Grow the underlying inode buffer with 2x factor when written to Before this, we would end up in memcpy() churn hell when a program was doing repeated write() calls to a file in /tmp. An even better solution will be to only grow the VM allocation of the underlying buffer and keep using the same physical pages. This would eliminate all the memcpy() work. I've benchmarked this using g++ to compile Kernel/Process.cpp. With these changes, compilation goes from ~35 sec to ~31 sec. :^)	2020-01-19 14:01:32 +01:00
Andreas Kling	1d02ac35fc	Kernel: Limit Thread::raw_backtrace() to the max profiler stack size Let's avoid walking overly long stacks here, since kmalloc() is finite.	2020-01-19 13:54:09 +01:00
Andreas Kling	6eab7b398d	Kernel: Make ProcessPagingScope restore CR3 properly Instead of restoring CR3 to the current process's paging scope when a ProcessPagingScope goes out of scope, we now restore exactly whatever the CR3 value was when we created the ProcessPagingScope. This fixes breakage in situations where a process ends up with nested ProcessPagingScopes. This was making profiling very fragile, and with this change it's now possible to profile g++! :^)	2020-01-19 13:44:53 +01:00
Andreas Kling	ad3f931707	Kernel: Optimize VM range deallocation a bit Previously, when deallocating a range of VM, we would sort and merge the range list. This was quite slow for large processes. This patch optimizes VM deallocation in the following ways: - Use binary search instead of linear scan to find the place to insert the deallocated range. - Insert at the right place immediately, removing the need to sort. - Merge the inserted range with any adjacent range(s) in-line instead of doing a separate merge pass into a list copy. - Add Traits<Range> to inform Vector that Range objects are trivial and can be moved using memmove(). I've also added an assertion that deallocated ranges are actually part of the RangeAllocator's initial address range. I've benchmarked this using g++ to compile Kernel/Process.cpp. With these changes, compilation goes from ~41 sec to ~35 sec.	2020-01-19 13:29:59 +01:00
Andreas Kling	87583aea9c	Kernel: Use copy_from_user() when appropriate during thread backtracing	2020-01-19 10:33:26 +01:00
Andreas Kling	38fc31ff11	Kernel: Always switch to own page tables when crashing/asserting I noticed this while debugging a crash in backtrace generation. If a process would crash while temporarily inspecting another process's address space, the crashing thread would still use the other process's page tables while handling the crash, causing all kinds of confusion when trying to walk the stack of the crashing thread.	2020-01-19 10:33:17 +01:00
Andreas Kling	f7b394e9a1	Kernel: Assert that copy_to/from_user() are called with user addresses This will panic the kernel immediately if these functions are misused so we can catch it and fix the misuse. This patch fixes a couple of misuses: - create_signal_trampolines() writes to a user-accessible page above the 3GB address mark. We should really get rid of this page but that's a whole other thing. - CoW faults need to use copy_from_user rather than copy_to_user since it's the source pointer that points to user memory. - Inode faults need to use memcpy rather than copy_to_user since we're copying a kernel stack buffer into a quickmapped page. This should make the copy_to/from_user() functions slightly less useful for exploitation. Before this, they were essentially just glorified memcpy() with SMAP disabled. :^)	2020-01-19 09:18:55 +01:00
Andreas Kling	2cd212e5df	Kernel: Let's say that everything < 3GB is user virtual memory Technically the bottom 2MB is still identity-mapped for the kernel and not made available to userspace at all, but for simplicity's sake we can just ignore that and make "address < 0xc0000000" the canonical check for user/kernel.	2020-01-19 08:58:33 +01:00
Andreas Kling	5ce9382e98	Kernel: Only require "stdio" pledge for sending signals to self This should match what OpenBSD does. Sending a signal to yourself seems basically harmless.	2020-01-19 08:50:55 +01:00
Sergey Bugaev	3e1ed38d4b	Kernel: Do not return ENOENT for unresolved symbols ENOENT means "no such file or directory", not "no such symbol". Return EINVAL instead, as we already do in other cases.	2020-01-18 23:51:22 +01:00
Sergey Bugaev	d0d13e2bf5	Kernel: Move setting file flags and r/w mode to VFS::open() Previously, VFS::open() would only use the passed flags for permission checking purposes, and Process::sys$open() would set them on the created FileDescription explicitly. Now, they should be set by VFS::open() on any files being opened, including files that the kernel opens internally. This also lets us get rid of the explicit check for whether or not the returned FileDescription was a preopen fd, and in fact, fixes a bug where a read-only preopen fd without any other flags would be considered freshly opened (due to O_RDONLY being indistinguishable from 0) and granted a new set of flags.	2020-01-18 23:51:22 +01:00
Sergey Bugaev	544b8286da	Kernel: Do not open stdio fds for kernel processes Kernel processes just do not need them. This also avoids touching the file (sub)system early in the boot process when initializing the colonel process.	2020-01-18 23:51:22 +01:00
Sergey Bugaev	6466c3d750	Kernel: Pass correct permission flags when opening files Right now, permission flags passed to VFS::open() are effectively ignored, but that is going to change. * O_RDONLY is 0, but it's still nicer to pass it explicitly * POSIX says that binding a Unix socket to a symlink shall fail with EADDRINUSE	2020-01-18 23:51:22 +01:00
Sergey Bugaev	7d4a267504	Kernel: Fix identifier casing	2020-01-18 23:51:22 +01:00
Andreas Kling	862b3ccb4e	Kernel: Enforce W^X between sys$mmap() and sys$execve() It's now an error to sys$mmap() a file as writable if it's currently mapped executable by anyone else. It's also an error to sys$execve() a file that's currently mapped writable by anyone else. This fixes a race condition vulnerability where one program could make modifications to an executable while another process was in the kernel, in the middle of exec'ing the same executable. Test: Kernel/elf-execve-mmap-race.cpp	2020-01-18 23:40:12 +01:00
Andreas Kling	4e6fe3c14b	Kernel: Symbolicate kernel EIP on process crash Process::crash() was assuming that EIP was always inside the ELF binary of the program, but it could also be in the kernel.	2020-01-18 14:38:39 +01:00
Andreas Kling	9c9fe62a4b	Kernel: Validate the requested range in allocate_region_with_vmobject()	2020-01-18 14:37:22 +01:00
Andreas Kling	aa63de53bd	Kernel: Use get_syscall_path_argument() in sys$execve() Paths passed to sys$execve() should certainly be subject to all the usual path validation checks.	2020-01-18 11:43:28 +01:00
Andreas Kling	b65572b3fe	Kernel: Disallow mmap names longer than PATH_MAX	2020-01-18 11:34:53 +01:00
Andreas Kling	c3e4387c57	Kernel: Stop flushing GDT/IDT registers all the time	2020-01-18 11:10:44 +01:00
Andreas Kling	17d4e74518	Kernel: Clean up and reorganize init.cpp This is where we first enter into the kernel, so we should make at least some effort to keep things nice and understandable.	2020-01-18 10:24:57 +01:00
Andreas Kling	6fea316611	Kernel: Move all CPU feature initialization into cpu_setup() ..and do it very very early in boot.	2020-01-18 10:11:29 +01:00
Andreas Kling	210adaeca6	ACPI: Re-enable ACPI initialization after paging changes	2020-01-18 10:03:14 +01:00
Andreas Kling	22d4920cef	RTL8139: Unbreak RealTek Ethernet driver after paging changes	2020-01-18 09:52:40 +01:00
Andreas Kling	94ca55cefd	Meta: Add license header to source files As suggested by Joshua, this commit adds the 2-clause BSD license as a comment block to the top of every source file. For the first pass, I've just added myself for simplicity. I encourage everyone to add themselves as copyright holders of any file they've added or modified in some significant way. If I've added myself in error somewhere, feel free to replace it with the appropriate copyright holder instead. Going forward, all new source files should include a license header.	2020-01-18 09:45:54 +01:00
Andreas Kling	19c31d1617	Kernel: Always dump kernel regions when dumping process regions	2020-01-18 08:57:18 +01:00
Andreas Kling	345f92d5ac	Kernel: Remove two unused MemoryManager functions	2020-01-18 08:57:18 +01:00
Andreas Kling	3e8b60c618	Kernel: Clean up MemoryManager initialization a bit more Move the CPU feature enabling to functions in Arch/i386/CPU.cpp.	2020-01-18 00:28:16 +01:00
Andreas Kling	a850a89c1b	Kernel: Add a random offset to the base of the per-process VM allocator This is not ASLR, but it does de-trivialize exploiting the ELF loader which would previously always parse executables at 0x01001000 in every single exec(). I've taken advantage of this multiple times in my own toy exploits and it's starting to feel cheesy. :^)	2020-01-17 23:29:54 +01:00
Andreas Kling	536c0ff3ee	Kernel: Only clone the bottom 2MB of mappings from kernel to processes	2020-01-17 22:34:36 +01:00
Andreas Kling	122c76d7fa	Kernel: Don't allocate per-process PDPT from super pages either The default system is now down to 3 super pages allocated on boot. :^)	2020-01-17 22:34:36 +01:00
Andreas Kling	ad1f79fb4a	Kernel: Stop allocating page tables from the super pages pool We now use the regular "user" physical pages for on-demand page table allocations. This was by far the biggest source of super physical page exhaustion, so that bug should be a thing of the past now. :^) We still have super pages, but they are barely used. They remain useful for code that requires memory with a low physical address. Fixes #1000.	2020-01-17 22:34:36 +01:00
Andreas Kling	f71fc88393	Kernel: Re-enable protection of the kernel image in memory	2020-01-17 22:34:36 +01:00
Andreas Kling	59b584d983	Kernel: Tidy up the lowest part of the address space After MemoryManager initialization, we now only leave the lowest 1MB of memory identity-mapped. The very first (null) page is not present. All other pages are RW but not X. Supervisor only.	2020-01-17 22:34:36 +01:00
Andreas Kling	545ec578b3	Kernel: Tidy up the types imported from boot.S a little bit	2020-01-17 22:34:36 +01:00
Andreas Kling	7e6f0efe7c	Kernel: Move Multiboot memory map parsing to its own function	2020-01-17 22:34:36 +01:00
Andreas Kling	ba8275a48e	Kernel: Clean up ensure_pte()	2020-01-17 22:34:36 +01:00
Andreas Kling	e362b56b4f	Kernel: Move kernel above the 3GB virtual address mark The kernel and its static data structures are no longer identity-mapped in the bottom 8MB of the address space, but instead move above 3GB. The first 8MB above 3GB are pseudo-identity-mapped to the bottom 8MB of the physical address space. But things don't have to stay this way! Thanks to Jesse who made an earlier attempt at this, it was really easy to get device drivers working once the page tables were in place! :^) Fixes #734.	2020-01-17 22:34:26 +01:00
Sergey Bugaev	4417bd97d7	Kernel: Misc tweaks	2020-01-17 21:49:58 +01:00
Sergey Bugaev	064cd2278c	Kernel: Remove the use of FileSystemPath in sys$realpath() Now that VFS::resolve_path() canonicalizes paths automatically, we don't need to do that here anymore.	2020-01-17 21:49:58 +01:00
Sergey Bugaev	68aeefa49b	ProcFS: Implement symlink magic	2020-01-17 21:49:58 +01:00
Sergey Bugaev	8642a7046c	Kernel: Let inodes provide pre-open file descriptions Some magical inodes, such as /proc/pid/fd/fileno, are going to want to open() to a custom FileDescription, so add a hook for that.	2020-01-17 21:49:58 +01:00
Sergey Bugaev	ae64fd1b27	Kernel: Let symlinks resolve themselves Symlink resolution is now a virtual method on an inode, Inode::resolve_as_symlink(). The default implementation just reads the stored inode contents, treats them as a path and calls through to VFS::resolve_path(). This will let us support other, magical files that appear to be plain old symlinks but resolve to something else. This is particularly useful for ProcFS.	2020-01-17 21:49:58 +01:00
Sergey Bugaev	e0013a6b4c	Kernel+LibC: Unify sys$open() and sys$openat() The syscall is now called sys$open(), but it behaves like the old sys$openat(). In userspace, open_with_path_length() is made a wrapper over openat_with_path_length().	2020-01-17 21:49:58 +01:00
Sergey Bugaev	d6184afcae	Kernel: Simplify VFS::resolve_path() further It turns out we don't even need to store the whole custody chain, as we only ever access its last element. So we can just store one custody. This also fixes a performance FIXME :^) Also, rename parent_custody to out_parent.	2020-01-17 21:49:58 +01:00
Andreas Kling	4d4d5e1c07	Kernel: Drop futex queues/state on exec() This state is not meaningful to the new process image so just drop it.	2020-01-17 16:08:00 +01:00
Andreas Kling	26a31c7efb	Kernel: Add "accept" pledge promise for accepting incoming connections This patch adds a new "accept" promise that allows you to call accept() on an already listening socket. This lets programs set up a socket for for listening and then dropping "inet" and/or "unix" so that only incoming (and existing) connections are allowed from that point on. No new outgoing connections or listening server sockets can be created. In addition to accept() it also allows getsockopt() with SOL_SOCKET and SO_PEERCRED, which is used to find the PID/UID/GID of the socket peer. This is used by our IPC library when creating shared buffers that should only be accessible to a specific peer process. This allows us to drop "unix" in WindowServer and LookupServer. :^) It also makes the debugging/introspection RPC sockets in CEventLoop based programs work again.	2020-01-17 11:19:06 +01:00
Andreas Kling	a9b24ebbe8	Kernel: Reindent linker script	2020-01-17 11:07:02 +01:00

... 16 17 18 19 20 ...

3011 commits