Commit Graph

12684 Commits (bfa60e2d4e6615b0c39ef2c1507930167aeab299)
 

Author SHA1 Message Date
bunnei 680ab61327
Merge pull request #2786 from ReinUsesLisp/vote
shader_ir: Implement VOTE on Nvidia drivers
6 years ago
ReinUsesLisp e3534700d7 shader_ir/conversion: Split int and float selector and implement F2F H1 6 years ago
ReinUsesLisp b13fbc25b8 shader_ir/conversion: Implement F2I F16 Ra.H1 6 years ago
ReinUsesLisp 6207751b00 float_set_predicate: Add missing negation bit for the second operand 6 years ago
Weiyi Wang 0580112940 Guard unistd.h with MacOS only macro
Fix compile error on Windows caused by #4877
Weird, I thought I saw this guard during the code review...
6 years ago
B3n30 246b515a86 citra_qt: on osx chdir to bundle dir to allow detection of user folder 6 years ago
ReinUsesLisp 4e35177e23 shader_ir: Implement VOTE
Implement VOTE using Nvidia's intrinsics. Documentation about these can
be found here
https://developer.nvidia.com/reading-between-threads-shader-intrinsics

Instead of using portable ARB instructions I opted to use Nvidia
intrinsics because these are the closest we have to how Tegra X1
hardware renders.

To stub VOTE on non-Nvidia drivers (including nouveau) this commit
simulates a GPU with a warp size of one, returning what is meaningful
for the instruction being emulated:

* anyThreadNV(value) -> value
* allThreadsNV(value) -> value
* allThreadsEqualNV(value) -> true

ballotARB, also known as "uint64_t(activeThreadsNV())", emits

VOTE.ANY Rd, PT, PT;

on nouveau's compiler. This doesn't match exactly to Nvidia's code

VOTE.ALL Rd, PT, PT;

Which is emulated with activeThreadsNV() by this commit. In theory this
shouldn't really matter since .ANY, .ALL and .EQ affect the predicates
(set to PT on those cases) and not the registers.
6 years ago
Fernando Sahmkow 83ec2091c1 Buffer Cache: Adress Feedback. 6 years ago
Fernando Sahmkow 6ce2c85047 Buffer_Cache: Implement flushing. 6 years ago
Fernando Sahmkow de8ff8a1c6 Buffer_Cache: Implement barriers. 6 years ago
Fernando Sahmkow 286f4c446a Buffer_Cache: Optimize and track written areas. 6 years ago
Fernando Sahmkow 5f4b746a1e BufferCache: Rework mapping caching. 6 years ago
Fernando Sahmkow 86d8563314 Buffer_Cache: Fixes and optimizations. 6 years ago
Fernando Sahmkow 862bec001b Video_Core: Implement a new Buffer Cache 6 years ago
bunnei b4a8cfbd00
Merge pull request #2748 from FernandoS27/align-memory
VM_Manager: Align allocated host physical memory to 256bytes
6 years ago
bunnei d654b3d82e
Merge pull request #2769 from FernandoS27/commands-flush
GPU: Flush commands on every dma pusher step.
6 years ago
bunnei dfdd20142e
Merge pull request #2777 from ReinUsesLisp/hsetp2-fe3h-fix
half_set_predicate: Fix HSETP2_C constant buffer offset
6 years ago
bunnei cedc1aab4a
Merge pull request #2753 from FernandoS27/float-convert
Shader_Ir: Implement F16 Variants of F2F, F2I, I2F.
6 years ago
bunnei 74a7ce1df7
Merge pull request #2773 from lioncash/test-unused
yuzu-tester/yuzu: Remove unused variable
6 years ago
ReinUsesLisp 80702aa88f renderer_opengl: Implement RGB565 framebuffer format 6 years ago
ReinUsesLisp 9cdf5c6c31 renderer_opengl: Use block linear swizzling for CPU framebuffers 6 years ago
ReinUsesLisp 8ad7268c75 renderer_opengl: Use VideoCore pixel format 6 years ago
ReinUsesLisp 9a76e94b3d gpu: Change optional<reference_wrapper<T>> to T* for FramebufferConfig 6 years ago
bunnei ef584f1a3a
Merge pull request #2747 from lioncash/audio
service/audren_u: Unstub ListAudioDeviceName
6 years ago
bunnei ca61e298b3
Merge pull request #2778 from ReinUsesLisp/nop
shader_ir: Implement NOP
6 years ago
bunnei 87bbefe55f
Merge pull request #2768 from ReinUsesLisp/hsetp2-fix
decode/half_set_predicate: Fix predicates
6 years ago
James Rowe 93abe1ccf3
Merge pull request #2789 from jroweboy/quickfix
Fixup! #2772 missed this one file
6 years ago
James Rowe 509734d818 Fixup! #2772 missed this one file 6 years ago
James Rowe e2392fe46f
Merge pull request #2766 from FearlessTobi/port-4849
Port citra-emu/citra#4849: "Qt: Fixed behaviour of buttons by connecting functors to correct signals"
6 years ago
James Rowe 0e9e166d85
Merge pull request #2772 from lioncash/ui
yuzu/CMakeLists: Remove qt5_wrap_ui macro usage
6 years ago
Lioncash 5980aa1e51 yuzu/CMakeLists: Remove qt5_wrap_ui macro usage
We can simply enable CMAKE_AUTOUIC and let CMake take care of handling
the UI code generation for targets.

As part of letting CMake automatically handle the header file parsing,
we must not name includes with "ui_*" unless they're related to the
output of the Qt UIC compiler. Because of this, we need to rename
ui_settings, given it would conflict with this restriction.
6 years ago
ReinUsesLisp 2ff8044806 shader_ir: Implement NOP 6 years ago
ReinUsesLisp ec0da3ef64 half_set_predicate: Fix HSETP2_C constant buffer offset 6 years ago
Silent 221250d922 Qt: Fixed behaviour of buttons by connecting functors to correct signals
Following screens got fixes:
- Configure/Debug
- Configure/Input
6 years ago
Flame Sage 978f7067ee
Merge pull request #2770 from DarkLordZach/azure-pr-fix
ci: Fix Azure PR Builds
6 years ago
Zach Hilman 9aef7e5e22 Correct apt permissions 6 years ago
Zach Hilman 6b2937bf76 Upgrade PIP version with APT 6 years ago
Zach Hilman a2d2a6b6dd Upgrade pip version 6 years ago
Zach Hilman d3ea2df06d Add missing dot 6 years ago
Lioncash 6e11cfcdf0 yuzu-tester/yuzu: Correct format string
Prevents an invalid formatting exception from being thrown.
6 years ago
Lioncash a0ee10b114 yuzu-tester/yuzu: Remove unused variable
Gets rid of a compilation warning.
6 years ago
Zach Hilman bcbec6f37c ci: Fix Azure PR Builds 6 years ago
Fernando Sahmkow e52c895559 GPU: Flush commands on every dma pusher step.
This commit ensures that the host gpu is constantly fed with commands to
work with, while the guest gpu keeps producing the rest of the commands.
This reduces syncing time between host and guest gpu.
6 years ago
bunnei 52f54c728d
Merge pull request #2592 from FernandoS27/sync1
Implement GPU Synchronization Mechanisms & Correct NVFlinger
6 years ago
ReinUsesLisp 77f1a676a1 decode/half_set_predicate: Fix predicates 6 years ago
Fernando Sahmkow a452ff983d MaxwellDMA: Fixes, corrections and relaxations.
This commit fixes offsets on Linear -> Tiled copies, corrects z pos
fortiled->linear copies, corrects bytes_per_pixel calculation in tiled
-> linear copies and relaxes some limitations set by latest dma fixes
refactors.
6 years ago
bunnei b0ff3179ef
Merge pull request #2739 from lioncash/cflow
video_core/control_flow: Minor changes/warning cleanup
6 years ago
bunnei 4d26550f5f
Merge pull request #2737 from FernandoS27/track-fix
Shader_Ir: Correct tracking to track from right to left
6 years ago
bunnei ccbc554949
Merge pull request #2689 from lioncash/tl
yuzu/main: Make error messages within OnCoreError more localization-friendly
6 years ago
bunnei 31e8a61527
Merge pull request #2743 from FernandoS27/surpress-assert
Downgrade and suppress a series of GPU asserts and debug messages.
6 years ago