Trending

See what the GitHub community is most excited about today.

  1. Code and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189

    Cuda 3,961 351 Built by @luanfujun @2m
  2. Fast parallel CTC.

    Cuda 3,310 915 Built by @ekelsen @jaredcasper @gangliao @bryancatanzaro @dxyzab
  3. Fully Convolutional Instance-aware Semantic Segmentation

    Cuda 1,279 373 Built by @Oh233 @daijifeng001 @liyi14 @YuwenXiong @ancientmooner
  4. GPU database engine

    Cuda 1,120 111 Built by @antonmks @hurdad @Randolph42 @AlexeyAB @pinkdevelops
  5. Squeeze-and-Excitation Networks

    Cuda 1,052 339 Built by @hujie-frank @lishen-shirley @GangSunLion @kambarakun
  6. MatConvNet: CNNs for MATLAB

    Cuda 1,034 669 Built by @vedaldi @lenck @jotaf98 @ankush-me @albanie
  7. Introduction to Parallel Programming class code

    Cuda 708 888 Built by @msarahan @chenghanlee @chenghan @cpowell @has207
  8. Optimized primitives for collective multi-GPU communication

    Cuda 616 187 Built by @sjeaugey @borisfom @nluehr @kylefernandes @lukeyeager
  9. Automatically exported from code.google.com/p/cuda-convnet2

    Cuda 590 246 Built by @akrizhevsky @bestimage-tencent
  10. A GPU implementation of Convolutional Neural Nets in C++

    Cuda 491 234 Built by @nitishsrivastava @avdmitry @rgrosse @itgod @neocortex14
  11. Fast, gpu-based CSV parser

    Cuda 475 27 Built by @antonmks @hurdad @eklitzke
  12. Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

    Cuda 468 182 Built by @jzbontar
  13. CUB is a flexible library of cooperative threadblock primitives and other utilities for CUDA kernel programming.

    Cuda 447 135 Built by @dumerrill @elehcim @ebrevdo @lukeyeager
  14. High-Performance Graph Primitives on GPUs

    Cuda 355 94 Built by @yzhwang @sgpyc @1duo @jowens @neoblizz
  15. Reference implementation of real-time autoregressive wavenet inference

    Cuda 344 50 Built by @BrianPharris @PetrochukM @pfriesch
  16. Efficient GPU kernels for block-sparse matrix multiplication and convolution

    Cuda 306 65 Built by @scott-gray @dchichkov @jonasschneider @openai-sys-okta-integration @scottconrad
  17. A CUDA backend for Torch7

    Cuda 282 187 Built by @soumith @killeent @dominikgrewe @colesbury @nicholas-leonard
  18. Code release for "Convolutional Two-Stream Network Fusion for Video Action Recognition", CVPR 2016.

    Cuda 280 115 Built by @feichtenhofer @abursuc
  19. A personal depthwise convolution layer implementation on caffe by liuhao.(only GPU)

    Cuda 256 106 Built by @yonghenglh6
  20. PyTorch implementation of Deformable Convolution

    Cuda 246 33 Built by @1zb
  21. Facebook's CUDA extensions.

    Cuda 236 55 Built by @nicolasvasilache @wickedfoo @ajtulloch @soumith @colesbury
  22. A CUDA implementation of SIFT for NVidia GPUs (2.6 ms on a GTX 1060)

    Cuda 234 125 Built by @Celebrandil @Helios-vmg
  23. My fork of Alex Krizhevsky's cuda-convnet from 2013 where I added dropout, among other features.

    Cuda 225 142 Built by @kashif @dnouri @yjxiong
  24. DeepSpeech neon implementation

    Cuda 209 66 Built by @tyler-nervana @tsocha @Neuroschemata @indie @F0REacH
  25. Source code that accompanies The CUDA Handbook.

    Cuda 199 105 Built by @ArchaeaSoftware @tycho @cdwfs
Other Languages
ProTip! Looking for most forked Cuda repositories? Try this search