Presentation is loading. Please wait.

Presentation is loading. Please wait.

Page 1 TI Internal Use Only Wireless Terminals Software Devt. Page 1 OMAP3430 Linux Power Management 16 Apr 2008 Version 1.

Similar presentations


Presentation on theme: "Page 1 TI Internal Use Only Wireless Terminals Software Devt. Page 1 OMAP3430 Linux Power Management 16 Apr 2008 Version 1."— Presentation transcript:

1 Page 1 TI Internal Use Only Wireless Terminals Software Devt. Page 1 OMAP3430 Linux Power Management 16 Apr 2008 Version 1

2 Wireless Terminals Software Devt. Page 2 Too big to fully describe but lets try –Digital Television with Linux - Architecture and Opportunities from Bas Engel cancelled. –Will fill in and expand on yesterday’s demo on OMAP3 power –Give an overview of architecture which was created with some typical data flows. HW overview Software overview –Ask any questions as we go please –Might not make it through all slides but hopefully touch on all of points in data flow and any questions

3 CE Linux Forum / Embedded Linux Conference 2008 Technical Showcase What is demonstrated How was the Linux improved Patch Availability Hardware Information Power Management on OMAP3 Richard Woodruff / Texas Instruments Texas Instruments OMAP3430 ARM CortexA8 + TI IVA2.2 + IMG SGX530 + TI ISP + more… http://focus.ti.com/pdfs/wtbu/swpu114g.pdf http://www.omapzoom.org/wiki/tiki-index.php http://linux.omap.com/pub/kernel/3430sdp/ Linux running PowerTop while being measured. The system goes to 0 volts in between system ticks. Full SOC context is lost and restored as dictated by data flow. Linux CpuIdle + menu governor drive 7 idle/c-states Linux CpuFreq + ondemand governor drive 4 operating/p- states. A 5 th overdrive operating point is available Linux Dynamic tick is implemented (CONFIG_NO_HZ) Traditional suspend/resume is implemented TI provides an open source reference implementation including TRM and boards on current Linux kernels. This has been implemented when available with standard Linux components TI has added a resource framework for multi-level resource evaluation to include multiple multi-state power domain support, new constraint support, power aggressive drivers, and more…

4 Wireless Terminals Software Devt. Page 4 Main generic points Idea of partial activity. –Functional clock can run dumping out data –Other flow path parts of the chip can be disabled and not accessible while your F-Clock runs –Software does part of the work and hardware dependencies do the rest System will restore context only on the data flow/control path. There is no need to restore the whole system. For this to happen drivers need to: –Aggressively get and release resources This is NOT suspend/resume. This is on activity path. The device is _still_ is accessible. You are just enabling/disabling around access or groups of access. They must follow TRM guidelines and do get/put. Don’t leave status around. –Structured so it can handle side effects of possible hardware register loss at clock disable time. –Prepared to restore context on clock enable. –Prepared to enable proper wake up sources specific to function –If have some operational restriction need to raise this as a constraint If I’m an audio device with a fixed external clock, I know my fifo size and its drain rate. I know at the driver my acceptable latency. I need to tell the system so it doesn’t cut power for two long. Need for some kind of shared resource (vote taking) manager to take the inputs and do the right things.

5 Wireless Terminals Software Devt. Page 5 Why bother / Few Driving Considerations End goal –Must maintain uW chip consumptions in idle modes –Must increase performance but keep similar power envelope as batteries are scaling at same rate and are costly New factor –New process 65nm, 45nm, 32nm… with ~30x or more leakage increase –Higher active power consumption with fast DPLLs (100sMHz to GHz) Design needs –Must support OFF mode 0 volts (full context loss) to support uW idle use cases –Must be able to wake from idle modes with low latency. –As low as possible users space intervention. System to go to lowest power state automatically.

6 Wireless Terminals Software Devt. Page 6 What is an OMAP34xx (from public TRM)

7 Wireless Terminals Software Devt. Page 7 OMAP3430 Power Partitioning

8 Wireless Terminals Software Devt. Page 8 HW PM Features in OMAP3430 Multi Level Clock Gating Module -> Clock Domain -> DPLL HW Organized for Power Saving Voltage Domains -> Power Domains -> Clock Domains -> Clocks 16 Power domains Two independently scalable voltage domains VDD1 & VDD2 Dynamic Voltage and Frequency Scaling (DVFS) Dynamically changing OPP to suit execution requirements Standby Leakage Management (SLM) Individual power domain & memory RET & OFF support System wide RET & OFF support SmartReflex™ Dynamically adjusting VDD1 & VDD2 OPP voltages to suit the silicon characteristics, temperature, voltage. Power IC Support Dedicated HS I2C interfaces for controlling SMPS, LDO etc.

9 Wireless Terminals Software Devt. Page 9 How 3430 Linux PM SW Exploits HW Features Control of power resources –Clocks, power domains, voltage domains, memories and power IC resources Inactive state power saving in OS Idle via. CPUIdle and suspend/resume –Automatic choice between multiple idle C-states by menu governor –Manual sleep through traditional Linux suspend/resume –System wide sleep states to OFF and Retention mode Device driver interfaces to control power resources Constraints & constraints evaluation OPP Control –Operating point control though Dynamic Voltage and Frequency Scaling (DVFS) and SmartReflex™ Active state dynamic load prediction and policy management via CPUFreq –Ondemand governor to control OPP/P-States Support for DSS Low Power Refresh mode

10 Wireless Terminals Software Devt. Page 10 OMAP 3430 Linux PM Reference Arch.

11 Wireless Terminals Software Devt. Page 11 Key 3430 Linux PM SW Knobs Aggressive resource usage control MPU Retention MPU OFF CORE Retention Core Off & Off Mode Support Other Power Domain Off / Retention DVFS SmartReflex™ HW Supervised Clock domain transitions Display Low Power Refresh

12 Wireless Terminals Software Devt. Page 12 Baseport contents Functionality available –PRCM APIs –Clock framework –Shared Resource Framework –Resource tracking –SmartReflex TM driver –Power IC driver –Constraints framework –CPUIdle framework –CPUFreq framework –Display Low Power Refresh –OFF Mode Support –Suspend/ Resume Support

13 Wireless Terminals Software Devt. Page 13 PRCM API Layer 1 & 2 Abstraction of register programming sequences Layer 1 provides basic functionalities –Enabling/disabling clocks –Setting power domain states etc. –Controlling DPLLs –Configuring DPLL/clock dividers –Setting clock source Layer 2 provides advanced functionalities –Doing the frequency change part of OPP change –Doing the voltage change part of OPP change –Setting the low power mode for the chip Used by higher layers Files –2.6_kernel\arch\arm\mach-omap2\prcm_34xx.c –2.6_kernel\arch\arm\mach-omap2\prcm-regs.h –2.6_kernel\include\asm-arm\arch-omap\prcm.h

14 Wireless Terminals Software Devt. Page 14 Clock Framework Centralized control for all clock related functionalities –Status (Enable/ Disable) –Rate –Dependencies (Clock Tree) Reference counting to avoid potential sharing conflicts Re-entrant code to handle simultaneous requestors Two Layers –Generic layer common for all OMAP platforms –OMAP3430 specific clock framework layer Hierarchical –Clocks represented as hierarchical tree taking into account clock dependencies (parent, children) Standard open source solution (main lined) Used by device drivers Internally uses the PRCM API layers 1 & 2 Files –2.6_kernel\arch\arm\plat-omap\clock.c –2.6_kernel\arch\arm\mach-omap2\clock_34xx.c –2.6_kernel\arch\arm\mach-omap2\clock_34xx.h –2.6_kernel\include\asm-arm\arch-omap\clock.h

15 Wireless Terminals Software Devt. Page 15 Shared Resources Framework Centralized control for controlling shared resources such as power domains, DPLLs and memory resources Integrated with clock framework to automatically control power domains (except mpu, core and neon) based on clock requests Reference counting –Avoids potential sharing conflicts –Ensures supply of highest requested power domain/ memory state Re-entrant code –Handles multiple simultaneous requests to a resource Synchronous calls Capability to get –Current state of the resource –Notifications on resource state change Generic framework that can be extended to control any type of resource. Is currently used to model constraints Files –2.6_kernel\arch\arm\mach-omap2\resource_34xx.c –2.6_kernel\arch\arm\mach-omap2\resource_34xx.h –2.6_kernel\include\asm-arm\arch-omap\resource.h

16 Wireless Terminals Software Devt. Page 16 Resources Tracking Layer Tracking layer integrated into respective frameworks : clock framework, shared resource framework Tracks resources in use and resource users –List of resources in use (and state) –List of users (drivers) of a particular resource

17 Wireless Terminals Software Devt. Page 17 SmartReflex ™ Driver SmartReflex™ driver allows for auto-compensation of VDD1 and VDD2 voltages (around the voltages specified by current OPP) by analyzing the silicon characteristics, temperature, voltage etc The driver configures the Voltage Processor and Voltage Controller The driver interfaces to the Triton2 Power IC via the SR I2C bus Files –2.6_kernel\arch\arm\mach-omap2\smartreflex.c –2.6_kernel\arch\arm\mach-omap2\smartreflex.h SW configuration –To enable Smart Reflex echo –n 1 >sys/power/sr_vdd1_autocomp (for VDD1) echo –n 1 >sys/power/ sr_vdd2_autocomp (for VDD2) –To disable Smart Reflex echo –n 0 >sys/power/sr_vdd1_autocomp (for VDD1) echo –n 0 >sys/power/ sr_vdd2_autocomp (for VDD2)

18 Wireless Terminals Software Devt. Page 18 Power IC Driver The Power IC driver provides APIs for configuring Triton2. Drivers call twl4030 APIs to enable/disable their LDOs. Triton specific configuration (enabling smart reflex, etc) is done during bootup in twl4030_power driver. Files –2.6_kernel/drivers/i2c/chips/twl4030_power.c –2.6_kernel/include/asm-arm/arch-omap/power_companion.h

19 Wireless Terminals Software Devt. Page 19 CPUIdle Framework The following C states are supported in Cpuidle driver: –C0 – System executing code –C1 – MPU WFI + Core active + No tick suppression –C2 – MPU WFI + Core active + Tick suppression –C3 – MPU CSWR + Core active + Tick suppression –C4 – MPU OFF + Core active + Tick suppression –C5 – MPU RET + Core RET + Tick suppression –C6 – MPU OFF + Core RET + Tick suppression –C7 – MPU OFF + Core OFF + Tick suppression Menu governor adapted to support OMAP3 specific requirements. Takes the following into account to decide the target sleep state: –Next timer expiry in the system –Comparing target residency with available sleep time –Comparing exit latency with system wide latency constraints –Checking for activity in the core domain Dynamic tick based on support in kernel. Files –2.6_kernel/arch/arm/mach-omap2/pm_34xx.c –2.6_kernel/arch/arm/mach-omap2/sleep_34xx.S –2.6_kernel/arch/arm/plat-omap/timer32k.c SW configuration and information –echo –n > /sys/power/cpuidle_deepest_state Locks the deepest system state at. ex.: If state_number = 3, the system does not go to CORE RET and CORE OFF. –cat /proc/pm_prepwstst Displays the previous power state of MPU, CORE, IVA and clocks in the system.

20 Wireless Terminals Software Devt. Page 20 CPUIdle Framework (contd.) The state of power domains other than MPU and CORE –Handled outside the idle loop by the shared resource framework –Based on the requests of the clocks in the power domains NEON and PER domains are controlled inside the idle loop since their states are closely coupled with MPU and CORE states respectively. The list of c-states to be supported needs to be refined based on the power savings and latency of each state. Cpuidle framework has these components: –CPUidle governor: Entity which decides the target state C state of the system. –CPUidle driver : Entity which populates the C states supported by the system and implements functions to transition to the C states. –Generic Cpuidle framework: Every time the idle loop is called, this framework calls the current governor to decide the target C state of the system. It then calls the current driver to transition to the C state selected by the governor.

21 Wireless Terminals Software Devt. Page 21 Suspend/ Resume Every driver implements suspend/resume after registration with LDM. Drivers release clocks and save context in suspend call and restore these when resume is called. Drivers which have already released their clocks and have saved their context need not do anything in their suspend call. Files –2.6_kernel/arch/arm/mach-omap2/pm_34xx.c –2.6_kernel/arch/arm/mach-omap2/prcm_34xx.c –2.6_kernel/arch/arm/mach-omap2/sleep_34xx.s

22 Wireless Terminals Software Devt. Page 22 OFF Mode Support All registers in the power domain are reset when the power domain goes to OFF. OFF mode could introduce considerable latency for wakeup. The system can enter chip off through two paths: –Idle loop –Suspend/Resume Driver adaptations for OFF mode support are discussed in later slides

23 Wireless Terminals Software Devt. Page 23 Constraints Framework Constraints are stored in the shared resource framework. Type of constraint can be obtained by specifying the name of the constraint. There are three types of constraints that are implemented: –Interrupt latency constraints Constraints to control system wide acceptable latency (the target C state that is programmed in idle thread) Internally uses the latency constraint APIs available in the kernel –Power Domain latency constraints Constraints to control transition of a particular power domain –Frequency/OPP constraints Constraints to specify a Frequency/OPP at which the driver can operate Files –2.6_kernel\arch\arm\mach-omap2\resource_34xx.c –2.6_kernel\arch\arm\mach-omap2\resource_34xx.h –2.6_kernel\include\asm-arm\arch-omap\resource.h

24 Wireless Terminals Software Devt. Page 24 DVFS DVFS is implemented using the constraints framework Drivers/Bridge express constraints on OPP or/and on ARM as well as DSP frequency using the constraints framework The constraints framework evaluates all the constraints available before setting the target OPP/Frequency If any pre/post notification functions are registered, the constraint framework takes care of calling them before and after the OPP/Frequency change Along with exported DVFS APIs for VDD2 OPP change, the high VDD2 OPP are pegged to high VDD1 OPPs (since a high CPU frequency often indicates high L3 bandwidth usage) SW configuration –To change the OPP of VDD1, use echo – n > /sys/power/ vdd1_opp_value. opp_number can be 1,2,3,4 or 5. Default is 3 (when cpufreq is disabled) –To change the OPP of VDD2, use echo – n > /sys/power/ vdd2_opp_value opp_number can be 2 or 3. Default is 3 (when cpufreq is disabled)

25 Wireless Terminals Software Devt. Page 25 CPUFreq Framework Consists of three major blocks –CPUFreq core –CPUFreq Driver –CPUFreq governor TI uses the kernel space governor ondemand CPUFreq Core –The CPU Freq Driver registers with the Core, and populates a structure CPUFreq_driver, which is populated with the API’s implemented by the driver CPUFreq Driver –Interfaces with shared resource framework to implement an OPP change CPUFreq governor –The ondemand governor sets the CPU speed depending on the current usage

26 Wireless Terminals Software Devt. Page 26 Display Low Power Refresh LPR – OSIdle with LCD ON. Display in VGA mode, 60fps DSS logic and pixel clocks are reduced Available DSS FIFOs are merged for greater FIFO depth CORE is configured to transition to RET when DSS is inactive SW configuration –To enable Low Power Refresh mode echo –n “enable” > sys/class/display_control/omap_disp_control/lpr –To disable Low Power Refresh mode echo –n “disable” > sys/class/display_control/omap_disp_control/lpr

27 Wireless Terminals Software Devt. Page 27 Device Driver PM support

28 Wireless Terminals Software Devt. Page 28 Device Driver Power Responsibilities Device drivers need to aggressively manage request/release of clocks through clock framework Device drivers should configure optimal power settings Device drivers need to register with the LDM and implement suspend() and resume() calls. Device drivers can specify constraints when required Device drivers need to implement context save/restore and interact with the appropriate framework for the same.

29 Wireless Terminals Software Devt. Page 29 Aggressive Clock Management Device drivers are expected to control their clocks on a request basis so that we will be able to hit low power states in idle loop whenever possible. If the drivers are not transaction based, there can be an inactivity timer for the driver to cut off clocks after a period of inactivity. Examples of changes being made to drivers: –Drivers which are transaction based can control clocks based on activity. ex.: i2c driver enables clocks when there are pending requests and disables them when there are no pending requests. –Camera – Clocks will be enabled as long as the driver is required (i.e. camera is running). –Display – The fbdev inactivity timer (which is tied to user activity) can be used to turn off display clock –MMC – Clocks are controlled on a per command basis. –GPTimer – Clocks are controlled as per requirement, i.e. when a timer is in use, they will be enabled and will be disabled when they are not in use –UART – Console clocks are cut in the idle loop (before putting core domain to retention) and other UART clocks could be controlled on a need basis. –USB – Clocks can be controlled as per requirement (only when transfers are going on).

30 Wireless Terminals Software Devt. Page 30 Device Driver Specific Power Settings After enabling clocks, the module must configure sysconfig register with the following settings: –AutoIdle –SmartIdle –SmartStandby –Clock activity – set it to FCLK ON ICLK OFF when clocks are enabled – change it to FCLK OFF ICLK OFF just before clocks are disabled –WKEN bit In addition any other additional wakeup registers/ power registers of the module need to be configured –usually mentioned in the power management section of the module in the TRM. The clocks domains are always configured in hardware supervised mode so that interface clock is automatically controlled by the hardware.

31 Wireless Terminals Software Devt. Page 31 Impact due to DVFS Validation requirement for DVFS –Determine lowest OPP at which a module can function –Validate that DVFS can happen when a functionality is on- going If a device driver requires a minimum CPU/L3 frequency for its operations, it can set a constraint on OPP or on MPU/IVA frequency using the constraints framework. A device driver can register for pre/post notifications on VDD1 or VDD2 DVFS change.

32 Wireless Terminals Software Devt. Page 32 OFF mode impact The following questions need to be addressed for OFF mode through idle loop: –When should the drivers save and restore context? –What is the context to be saved and restored? –During driver operation, when can a power domain go to off mode? –How will the power management framework decide when to put a power domain to OFF?

33 Wireless Terminals Software Devt. Page 33 Context Save & Restore When all drivers in a power domain release their clocks, the power domain can go to RET or OFF state. –The shared resource framework programs the power domain to target state depending on the latency constraints in the system. Drivers can follow any of the following methods to save and restore their context: –Always save/ always restore – drivers which do not have lots of registers can always save context and restore context because it will not cause a lot of overhead. –Early save/ restore on demand – In this method, drivers save context every time they release their clocks but restore it only if the power domain has actually gone to off after saving – this makes sense for drivers which have a large restore time with save time being minimal

34 Wireless Terminals Software Devt. Page 34 How to Prevent Domain Power OFF? Drivers prevent OFF mode by specifying latency constraints. –Power domain latency constraint can be used to control the transitions (OFF/RET) of a particular power domain In some cases drivers do not know when to express a power domain latency constraint, either because it is not aware of the current system level use case or because it does not know when its API will be called next by an upper layer stack. For such cases, an inactivity timer is provided (for CORE and PER domains) that will delay the transition of a power domain to OFF –Interrupt latency constraint will be used to prevent a system wide low power state Drivers can also prevent OFF mode by keeping their clocks active (in which case OFF is not attempted).

35 Wireless Terminals Software Devt. Page 35 Debugging To check the count of MPU/CORE low power transitions in CPUIdle –cat /proc/pm_prepwst To check if voltage transitions are happening in chip RET/OFF cases –Confirm that LED D5 on OMAP3430 SDP is blinking To confirm frequency change during DVFS –Observe "BogoMIPS" by executing cat /proc/cpuinfo To check clocks active in the system –cat /proc/omap_clocks To modify the FB inactivity timeout value –echo –n > /sys/power/ fb_timeout_value To disable framebuffer inactivity timer, do: –echo –n 0 > /sys/power/ fb_timeout_value To disable hw supervised clock domain transitions in menuconfig –Power Management Options -> Omap power management options -> OMAP3430 enable H/W supervised transition for clock domains

36 Wireless Terminals Software Devt. Page 36 Debugging (contd.) To disable MPU/CORE OFF transitions in menuconfig –To disable CORE OFF Go to Power Management Options -> OMAP power management options Unselect the option “Enable CORE OFF in cpuidle” –To disable MPU OFF: Go to Power Management Options -> OMAP power management options Unselect the option “Enable MPU OFF in suspend/resume and cpuidle” To enable CPUFreq with ondemand governor –In menuconfig change the following Enable CPU Frequency scaling -> CPU Frequency scaling Enable CPU Frequency scaling -> CPU Frequency scaling -> ‘ondemand’ cpufreq policy governor Change Power Management options -> OMAP Power management options ->Default VDD1 OPP Level to MPU-125/DSP-90Mhz Change Power Management options -> OMAP Power management options ->Default VDD2 OPP level to L3 83 MHz –After bootup execute the following command, this line can also be added in /etc/init.d/rcS file echo "ondemand" > /sys/devices/system/cpu/cpu0/cpufreq/scaling_governor


Download ppt "Page 1 TI Internal Use Only Wireless Terminals Software Devt. Page 1 OMAP3430 Linux Power Management 16 Apr 2008 Version 1."

Similar presentations


Ads by Google