Category Archives: Uncategorized - Page 2

CUDA Programming Courses

GPU/CUDA Programming for High Performance Computing
(in Mandarin, Spring 2013)
Total number of lectures: 18 (3 hours per week)
Programming Assignments: 4

This course is concerned with programming GPU’s for general purpose high performance computing (not for graphics). GPUs have evolved from supporting graphics to providing a computing engine for high performance computing. The world’s fastest compute system, the Tianhe‐1A achieves it performance (2.507 Petaflops) through the use of 7000 GPUs. Many clusters and computer systems are being designed to incorporate GPUs into their compute nodes to achieve orders of magnitude speed improvements. In this course, we will learn how to program such systems. The platform can be either a Windows or a Linux system and we will learn how to use Window systems that have GPUs and appropriate software installed in a departmental computing lab and also a departmental Linux server that has a high performance 100‐core GPU installed.Tentative topics will include:

–History of GPUs leading to their use and design for HPC
–Introduction to the GPU programming model and CUDA, host and device memories
–Basic CUDA program structure, kernel calls, threads, blocks, grid, thread addressing, predefined variables, example code: vector and matrix addition, matrix multiplication
–Using Windows and Linux environments to compile and execute simple CUDA programs.
–Timing execution time
–Host synchronization
–Routines called from device.
–Incorporating graphical output.
–Global barrier synchronization.
–Coalesced global memory access
–Shared memory and constant memory usage
–Critical sections and atomics. Example use: counter and histogram programs
–CUDA streams
–Pinned memory, zero copy memory, multiple GPUs, portable pinned memory
–Optimizing performance, using knowledge of warps, and other characteristics of GPUs, overlapping computations, effects of control, flow,
–Parallel algorithms suitable for GPUs, parallel sorting,
–Building complex applications, debugging tools,
–Hybrid programming incorporating OpenMP and/or MPI with CUDA, GPU clusters, distributed clusters, …
–Possible advanced materials: texture memory, using GPU also for graphics

Parallel programming
(in English, Autumn 2013)
Total number of lectures: 18 (3 hours per week)
Number of assignments: 2

This course is planned and developed for graduate students. As multicore CPUs and many-core GPUs become even more popular, parallel computing platforms are easily to find each day. This course intends to cover multicore CPU and CUDA architectures, and will introduce with examples OpenMP, MPI, CUDA and OpenCL. Opportunities will be provided to students to acquire hands-on programming experiences. NVIDIA CUDA and OpenCLwill be used to learn GPU programming on NVIDIA and ATI GPUs, and OpenMP and MPI to explore the computational power on multicore CPUs clusters. Tentative topics will include:

–Study Multicore CPU and GPU architectures,
–Study network topologies,
–Learn how to write parallel programs using OpenMP, MPI, OpenCL and CUDA
–Study the issues that influence the speedup and efficiency of parallel programs
–Study some parallel algorithms, as sorting, image processing, graphs, and numerical computation

1)Barry Wilkinson, Michael Allen, “Parallel Programming: Techniques and Applications Using Networked Workstations and Parallel Computers”, 2nd Edition, Prentice Hall
2)Michael J. Quinn, “Parallel Programming in C with MPI and OpenMP”, Mc Graw-Hill
3)Jason Sanders and Edward Kandrot, “CUDA by Example: An Introduction to General -Purpose GPU Programming”, Addison-Wesley Professional, 2010
4)Programming Massively Parallel Processors A hands‐on Approach,David B. Kirk and Wen‐mei W. Hwu,Morgan Kaufmann, 2010
5)GPU Computing Gems Emerald Edition,By Wen‐Mei W. Hwu, Editor in Chief,Morgan Kaufmann, 2011

Special Track: FLOSS for Safety Critical Systems — Embedded World Exhibition and Conference

This special track of the Embedded World Conference on Thursday, March 3, 2011 is organized by OSADL’s Safety Coordinator Prof. Nicholas Mc Guire and will focus on the use of Free and Open Source Software for safety critical systems. For a direct link to the related section of the online program of the Embedded World Conference click here.

Call for Papers – Abstract Submission – Submitted Papers

CfP at:

Dr. Markus Freiburghaus gave us a presentation

Dr. Markus Freiburghaus from University of Applied Sciences Northwestern Switzerland made a presentation about the education system of Switzerland on Oct 14th 2010. The title of his presentation is ‘Stability and innovation: antagonism or cause-and-effect’.



Engineers from ETHERNET POWERLINK Association China visit DSLab,SISE

Two engineers from ETHERNET POWERLINK Association China visits DSLab on 18th, November,2009. Engineers introduce the POWERLINK protocol and its usage in industrial such as printing. Prof. Nicholas McGuire give a presentation on safety and security of industrial. Dr. Zhou introduces the working and project in DSLab. Student Li chanjuan give a presentation on Node Ordering Protocol which is new real-time
communication protocol designed by DSLab. We comminucates in worm atemosphere in whole day.

XM-eRTL-4.0 release candidate 1 (rc1) has been announced

Announcement of XM-eRTL-4.0 release candidate 1 (rc1)
eRTL release XM-eRTL-4.0 as replacement for the legacy RTLinux/GPL 3.2 is now ready for download as release candidate 1 (XM-eRTL-4.0-rc1).
XM/eRTL based on the hypervisor XtratuM 1 as well as PaRTiKLe developed at theUniversital Politecnica de Valencia is now being continued by DSLab at Lanzhou University and has been extended to be a full featured replacement of RTLinux/GPL.
The wiki of XM/eRTL is

The development tree has been moved to git and is publically available at:

You can also use:

git clone

and appropriate infrastructure to allow community interaction,patch-submission, repository access is being set up.

The DSLab team will continue to develop and enhance XM-eRTL in the future in tight coordination with the Universitat Politecnica de Valencia,DISCA, based on strong POSIX binding and compatibility to the vanilla Linux kernel as its root domain.
Enhancements in this first release candidate include:

XM-FIFO : fifo communication extension between RT and non-RT domains

XM-SHM  : shared memory module

XM-TRACE: a runtime tracer for XtratuM core and RT domains

XM-DEV  : XtratuM device driver domain

Modules under Development:

XM-PPC  : though still in the test phase XM-eRTL-4.0 is in the alpha stage

on PowerPC 440 and 405.

XM-MIPS : still in an early development stage XM-eRTL-4.0 is targeting

support for the Loongson MIPS processors (2F)

The DSLab XtratuM team.

L4eRTL 0.91 and L4eRTL 0.92 has been announced

L4eRTL 0.91 and L4eRTL 0.92 has been announced ,they provide Posix interface,an have a good realtimeperformance.

L4eRTL is real-time virtualization solution based L4/Fiasco microkernel and it could allow coexisting of hard real-time operating system and soft real-time operating system.

Congratulating four students from DSLab receive MS. Degree in computer area

Bai Shuwei, Pu Yiqiao, She Kairui and Zhang Wei who are from DSLab get MS. degree  in computer area from Lanzhou University successful today. In the past three years, the four students focus on the safety research system. Their achievement is excellent in the past three. They published 20 papers total and one is indexed by SCI, 9 is indexed by EI. Two of them received IBM Outstanding Students Scholarship. As the team leader or main team members achieved the Final Contest in 2006 Power Designing Contest Top 2, 2007  Power Designing Contest Excellent Award, 2008 Gansu Challenge Cup accessit.

Prof. Nicholas McGuire and Prof. Zhou Qingguo visited the Hefei GaoKe Information Technology Engineering company

Prof. Nicholas McGuire andf Prof.Zhou Qingguo visited the Hefei GaoKe Information Technology Engineering company(, which is committed to high reliability and high safety computer control system development., and they met Prof Han Jianghong, Prof Lu yang, Prof Wei Zhen, Prof Tang Hao, Prof Shi Jiugen, Dr Wei Zhenchun, Dr Liu Zhengyu and Dr Ma Xuesen.




Embedded world Conference 2009 will be held in March 3 – 5, 2009, Messezentrum Nuernberg, Germany

Embedded  world Conference 2009 will be held in March 3 – 5, 2009, Messezentrum ernberg,Germany


Prof. Nicholas McGuire will give seminars at the embedded developers forum in germany in October 2008