Presentation is loading. Please wait.

Presentation is loading. Please wait.

Performance Model for Future Multicore Process Designs Yipkei Kwok 02/06/2008.

Similar presentations


Presentation on theme: "Performance Model for Future Multicore Process Designs Yipkei Kwok 02/06/2008."— Presentation transcript:

1 Performance Model for Future Multicore Process Designs Yipkei Kwok 02/06/2008

2 A Non-Work-Conserving Operating System Scheduler For SMT Processors Authors: A. Fedorova et. al Calculate optimal level of //ism of SMT Processors at run time Analytical model Estimate the workload’s IPC for a given degree of concurrency 1 st id’fy performance bottle Suppressing L2 misses improves performance the best

3 A Non-Work-Conserving Operating System Scheduler For SMT Processors Factors –N –perf_cache_CPI(N) –L2_RMR –L2_WMR –L2_WBR_R –L2_WBR_W –WSC –L2_MCOST

4 Non-Work-Conserving Operating System Scheduler For SMT Processors 2-phases scheduling –Preparation phase Collect model inputs under full //ism W./ hardware counters Till the retirement of the 100 million-th instructions –Optimization phase Estimate optimal N Enforce it Till … … –New locality phase

5 Limitations 3-56% improvement but ….. Empirical model based on UltraSparc T1 SMT only –But expandable w./, hopefully, reasonable effort Once expanded, performance prediction What’re needed? –Extra factors?

6 What new factors? Depends on systems to model Shared-memory machine Threaded // workloads SMP of CMPs SMT per core

7 What new factors? Architecture –Homo/hetero cores Difference in speed, or functionality –Level of cache sharing –Interconnects

8 What new factors? Params –#(cores) –Cache size –Degree of set-associativity –#(cores) sharing a cache –Bus, ring, crossbar, tiny-network –Switching & flow mechanisms –Routing algos –Fault tolerance techniques

9 What new factors? Protocols –Cache coherence protocol at dedicated/semi- shared cache Algorithms –Block replacement algorithm –Algorithms of cache coherence and data consistency protocols

10 Potential uses Performance prediction for future processors Scheduler

11 Similar work exists? Multi2Sim (2007) –Framework simulating the system working as a whole –Yet, app-only simulation –Evaluate multicore-multithreaded processors –3 major components simulated Core Cache hierarchy Interconnect –Note: source code available

12 Enough? Limitations –Homogenous core –Topology Bus only W./ variable bus width though


Download ppt "Performance Model for Future Multicore Process Designs Yipkei Kwok 02/06/2008."

Similar presentations


Ads by Google