L1 Cache and TLB Enhancements to the RAMpage Memory Hierarch(13)

时间:2026-04-30   来源:未知    
字号:

Abstract. The RAMpage hierarchy moves main memory up a level to replace the lowest-level cache by an equivalent-sized SRAM main memory, with a TLB caching page translations for that main memory. This paper illustrates how more aggressive components higher

contextswitchesonmisses,ofthevariationspresentedhere,isabletohidetheincreasinge ectivelatencyofDRAM.IncreasingthesizeoftheTLB,aspredicted,increasedtherangeofSRAMmainmemorypagesizesoverwhichRAMpageisviable,wideningtherangeofchoicesforadesigner.

5Conclusion

ThispaperhasexaminedenhancementstoRAMpage,whichmeasureitspoten-tialforfurtherimprovement,asopposedtosimilarimprovementstoaconven-tionalhierarchy.Asinpreviouswork,RAMpagehasbeenshowntoscalebetterastheCPU-DRAMspeedgapgrows.Inaddition,ithasbeenshownthatcon-textswitchesonmissescantakeadvantageofamoreaggressivecoreincludingabiggerL1cache,andabiggerTLB.Theremainderofthissectionsummarizesresults,outlinesfutureworkandsumsupoverall ndings.

5.1SummaryofResults

Introducingsigni cantlylargerL1caches–evenifthiscouldbedonewith-outproblemswithmeetingclockcycletargets–haslimitedbene ts.Scalingtheclockspeedupbyafactorof8achievesonlyabout77%ofthisspeedupinacon-ventionalhierarchymeasuredhere.RAMpagewithcontextswitchesonmissesisabletomakee ectiveuseofalargerL1cache,andachievessuperlinearspeedupwithrespecttoaslowerclockspeedandsmallerL1cache.Whilethise ectcanonlybeexpectedinRAMpagewithanunrealisticallylargeL1,thisresultshowsthatincreasinglyaggressiveL1cachesarenotasimportantasolutiontothememorywallproblemas ndingalternativeworkonamisstoDRAM.

ThatresultsforRAMpagewithoutcontextswitchesonmissesareanim-provementbutnotassigni cantasresultswithcontextswitchesonmissessug-geststhatattemptsatimprovingassociativityandreplacementstrategywillnotbesu cienttobridgethegrowingCPU-DRAMspeedgap.

LargerTLBs,asexpected,increasetherangeofusefulRAMpageSRAMmainmemorypagesizes,thoughtheperformancebene tontheworkloadmeasuredwasnotsigni cantversuslargerpagesizesandamoremodest-sizedTLB.

5.2FutureWork

ItwouldbeinterestingtomatchRAMpagewithmodelsforsupportingmorethanoneinstructionstream.SMT,whileaddinghardwarecomplexity,isanestablishedapproach[19],withexistingimplementations[3].Anotherthingtoexploreisalternativeinterconnectarchitectures,somultiplerequestsforDRAMcouldbeoverlapped[24].HyperTransport[2]isacandidate.Amoredetailedsimulationmodellingoperatingsysteme ectsaccuratelywouldbeuseful.SimOS

[26],forexample,couldbeused.Furthervariationstoexploreincludevirtually-addressedL1andhardwareTLBmisshandling.Finally,itwouldbeinterestingtobuildaRAMpagemachine.

L1 Cache and TLB Enhancements to the RAMpage Memory Hierarch(13).doc 将本文的Word文档下载到电脑,方便复制、编辑、收藏和打印
× 游客快捷下载通道(下载后可以自由复制和排版)
VIP包月下载
特价:19 元/月 原价:99元
低至 0.1 元/份 每月下载300
全站内容免费自由复制
VIP包月下载
特价:19 元/月 原价:99元
低至 0.1 元/份 每月下载300
全站内容免费自由复制
注:下载文档有可能出现无法下载或内容有问题,请联系客服协助您处理。
× 常见问题(客服时间:周一到周五 9:30-18:00)