CPUID

In the x86 architecture, the CPUID instruction (identified by a CPUID opcode) is a processor supplementary instruction (its name derived from CPU Identification) allowing software to discover details of the processor. It was introduced by Intel in 1993 with the launch of the Pentium and SL-enhanced 486 processors.[1]

A program can use the CPUID to determine processor type and whether features such as MMX/SSE are implemented.

History

Prior to the general availability of the CPUID instruction, programmers would write esoteric machine code which exploited minor differences in CPU behavior in order to determine the processor make and model.[2][3] With the introduction of the 80386 processor, EDX on reset indicated the revision but this was only readable after reset and there was no standard way for applications to read the value.

Outside the x86 family, developers are mostly still required to use esoteric processes (involving instruction timing or CPU fault triggers) to determine the variations in CPU design that are present.

In the Motorola 680x0 family — that never had a CPUID instruction of any kind — certain specific instructions required elevated privileges. These could be used to tell various CPU family members apart. In the Motorola 68010 the instruction MOVE from SR became privileged. This notable instruction (and state machine) change allowed the 68010 to meet the Popek and Goldberg virtualization requirements. Because the 68000 offered an unprivileged MOVE from SR the 2 different CPUs could be told apart by a CPU error condition being triggered.

While the CPUID instruction is specific to the x86 architecture, other architectures (like ARM) often provide on-chip registers which can be read in prescribed ways to obtain the same sorts of information provided by the x86 CPUID instruction.

Calling CPUID

The CPUID opcode is 0F A2.

In assembly language, the CPUID instruction takes no parameters as CPUID implicitly uses the EAX register to determine the main category of information returned. In Intel's more recent terminology, this is called the CPUID leaf. CPUID should be called with EAX = 0 first, as this will store in the EAX register the highest EAX calling parameter (leaf) that the CPU implements.

To obtain extended function information CPUID should be called with the most significant bit of EAX set. To determine the highest extended function calling parameter, call CPUID with EAX = 80000000h.

CPUID leaves greater than 3 but less than 80000000 are accessible only when the model-specific registers have IA32_MISC_ENABLE.BOOT_NT4 [bit 22] = 0 (which is so by default). As the name suggests, Windows NT 4.0 until SP6 did not boot properly unless this bit was set,[4] but later versions of Windows do not need it, so basic leaves greater than 4 can be assumed visible on current Windows systems. As of July 2014, basic valid leaves go up to 14h, but the information returned by some leaves are not disclosed in the publicly available documentation, i.e. they are "reserved".

Some of the more recently added leaves also have sub-leaves, which are selected via the ECX register before calling CPUID.

EAX=0: Highest Function Parameter and Manufacturer ID

This returns the CPU's manufacturer ID string  a twelve-character ASCII string stored in EBX, EDX, ECX (in that order). The highest basic calling parameter (the largest value that EAX can be set to before calling CPUID) is returned in EAX.

Here is a list of processors and the highest function implemented.

Highest Function Parameter
ProcessorsBasicExtended
Earlier Intel 486CPUID Not Implemented
Later Intel 486 and Pentium0x01Not Implemented
Pentium Pro, Pentium II and Celeron0x02Not Implemented
Pentium III0x03Not Implemented
Pentium 40x020x8000 0004
Xeon0x020x8000 0004
Pentium M0x020x8000 0004
Pentium 4 with Hyper-Threading0x050x8000 0008
Pentium D (8xx)0x050x8000 0008
Pentium D (9xx)0x060x8000 0008
Core Duo0x0A0x8000 0008
Core 2 Duo0x0A0x8000 0008
Xeon 3000, 5100, 5200, 5300, 5400 (5000 series)0x0A0x8000 0008
Core 2 Duo 8000 series0x0D0x8000 0008
Xeon 5200, 5400 series0x0A0x8000 0008
Atom0x0A0x8000 0008
Nehalem-based processors0x0B0x8000 0008
Ivy Bridge-based processors 0x0D 0x8000 0008
Skylake-based processors (proc base & max freq; Bus ref. freq) 0x16 0x8000 0008
System-On-Chip Vendor Attribute Enumeration Main Leaf 0x17 0x8000 0008

The following are known processor manufacturer ID strings:

The following are ID strings used by open source soft CPU cores:

  • "MiSTer AO486"  ao486 CPU[6]
  • "GenuineIntel"  v586 core[7] (this is identical to the Intel ID string)

The following are known ID strings from virtual machines:

For instance, on a GenuineIntel processor values returned in EBX is 0x756e6547, EDX is 0x49656e69 and ECX is 0x6c65746e. The following example code displays the vendor ID string as well as the highest calling parameter that the CPU implements.

	.intel_syntax noprefix
	.text
.m0: .string "CPUID: %x\n"
.m1: .string "Largest basic function number implemented: %i\n"
.m2: .string "Vendor ID: %s\n"

    .globl main

main:
	push    r12
	mov	    eax, 1
	sub	    rsp, 16
    cpuid
    lea	    rdi, .m0[rip]
	mov	    esi, eax
	call	printf
	mov     eax, 0
    cpuid
	lea	    rdi, .m1[rip]
	mov	    esi, eax
	mov	    r12d, edx
	mov	    ebp, ecx
	call    printf
	mov     3[rsp], ebx
	lea	    rsi, 3[rsp]
    lea	    rdi, .m2[rip]
    mov     7[rsp], r12d
    mov     11[rsp], ebp
	call	printf
	add	    rsp, 16
	pop	    r12
	ret

    .section .note.GNU-stack,"",@progbits

EAX=1: Processor Info and Feature Bits

This returns the CPU's stepping, model, and family information in register EAX (also called the signature of a CPU), feature flags in registers EDX and ECX, and additional feature info in register EBX.[10]

Processor Version Information
EAX
31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0
Reserved Extended Family ID Extended Model ID Reserved Processor Type Family ID Model Stepping ID
  • Stepping ID is a product revision number assigned due to fixed errata or other changes.
  • The actual processor model is derived from the Model, Extended Model ID and Family ID fields. If the Family ID field is either 6 or 15, the model is equal to the sum of the Extended Model ID field shifted left by 4 bits and the Model field. Otherwise, the model is equal to the value of the Model field.
  • The actual processor family is derived from the Family ID and Extended Family ID fields. If the Family ID field is equal to 15, the family is equal to the sum of the Extended Family ID and the Family ID fields. Otherwise, the family is equal to the value of the Family ID field.
  • The meaning of the Processor Type field is given in the table below.
Processor Type
Type Encoding in Binary
Original equipment manufacturer (OEM) Processor 00
Intel Overdrive Processor 01
Dual processor (not applicable to Intel486 processors) 10
Reserved value 11
Additional Information
Bits EBX Valid
7:0 Brand Index
15:8 CLFLUSH line size (Value * 8 = cache line size in bytes) if CLFLUSH feature flag is set.

CPUID.01.EDX.CLFSH [bit 19]= 1

23:16 Maximum number of addressable IDs for logical processors in this physical package;

The nearest power-of-2 integer that is not smaller than this value is the number of unique initial APIC IDs reserved for addressing different logical processors in a physical package.

Former use: Number of logical processors per physical processor; two for the Pentium 4 processor with Hyper-Threading Technology.[11]

if Hyper-threading feature flag is set.

CPUID.01.EDX.HTT [bit 28]= 1

31:24 Local APIC ID: The initial APIC-ID is used to identify the executing logical processor.

It can also be identified via the cpuid 0BH leaf ( CPUID.0Bh.EDX[x2APIC-ID] ).

Pentium 4 and subsequent processors.

The processor info and feature flags are manufacturer specific but usually, the Intel values are used by other manufacturers for the sake of compatibility.

Feature Information
BitEDXECX
ShortFeatureShortFeature
0 fpuOnboard x87 FPUsse3Prescott New Instructions-SSE3 (PNI)
1 vmeVirtual 8086 mode extensions (such as VIF, VIP, PIV)pclmulqdqPCLMULQDQ
2 deDebugging extensions (CR4 bit 3)dtes6464-bit debug store (edx bit 21)
3 psePage Size ExtensionmonitorMONITOR and MWAIT instructions (SSE3)
4 tscTime Stamp Counterds-cplCPL qualified debug store
5 msrModel-specific registersvmxVirtual Machine eXtensions
6 paePhysical Address ExtensionsmxSafer Mode Extensions (LaGrande)
7 mceMachine Check ExceptionestEnhanced SpeedStep
8 cx8CMPXCHG8 (compare-and-swap) instructiontm2Thermal Monitor 2
9 apicOnboard Advanced Programmable Interrupt Controllerssse3Supplemental SSE3 instructions
10 (reserved)cnxt-idL1 Context ID
11 sepSYSENTER and SYSEXIT instructionssdbgSilicon Debug interface
12 mtrrMemory Type Range RegistersfmaFused multiply-add (FMA3)
13 pgePage Global Enable bit in CR4cx16CMPXCHG16B instruction
14 mcaMachine check architecturextprCan disable sending task priority messages
15 cmovConditional move and FCMOV instructionspdcmPerfmon & debug capability
16 patPage Attribute Table(reserved)
17 pse-3636-bit page size extensionpcidProcess context identifiers (CR4 bit 17)
18 psnProcessor Serial NumberdcaDirect cache access for DMA writes[12][13]
19 clfshCLFLUSH instruction (SSE2)sse4.1SSE4.1 instructions
20 (reserved)sse4.2SSE4.2 instructions
21 dsDebug store: save trace of executed jumpsx2apicx2APIC
22 acpiOnboard thermal control MSRs for ACPImovbeMOVBE instruction (big-endian)
23 mmxMMX instructionspopcntPOPCNT instruction
24 fxsrFXSAVE, FXRESTOR instructions, CR4 bit 9tsc-deadlineAPIC implements one-shot operation using a TSC deadline value
25 sseSSE instructions (a.k.a. Katmai New Instructions)aesAES instruction set
26 sse2SSE2 instructionsxsaveXSAVE, XRESTOR, XSETBV, XGETBV
27 ssCPU cache implements self-snooposxsaveXSAVE enabled by OS
28 httHyper-threadingavxAdvanced Vector Extensions
29 tmThermal monitor automatically limits temperaturef16cF16C (half-precision) FP feature
30 ia64IA64 processor emulating x86rdrndRDRAND (on-chip random number generator) feature
31 pbePending Break Enable (PBE# pin) wakeup capabilityhypervisorHypervisor present (always zero on physical CPUs)[14][15]

Reserved fields should be masked before using them for processor identification purposes.

EAX=2: Cache and TLB Descriptor information

This returns a list of descriptors indicating cache and TLB capabilities in EAX, EBX, ECX and EDX registers.

EAX=3: Processor Serial Number

This returns the processor's serial number. The processor serial number was introduced on Intel Pentium III, but due to privacy concerns, this feature is no longer implemented on later models (the PSN feature bit is always cleared). Transmeta's Efficeon and Crusoe processors also provide this feature. AMD CPUs however, do not implement this feature in any CPU models.

For Intel Pentium III CPUs, the serial number is returned in the EDX:ECX registers. For Transmeta Efficeon CPUs, it is returned in the EBX:EAX registers. And for Transmeta Crusoe CPUs, it is returned in the EBX register only.

Note that the processor serial number feature must be enabled in the BIOS setting in order to function.

EAX=4 and EAX=Bh: Intel thread/core and cache topology

These two leaves are used for processor topology (thread, core, package) and cache hierarchy enumeration in Intel multi-core (and hyperthreaded) processors.[16] As of 2013 AMD does not use these leaves but has alternate ways of doing the core enumeration.[17]

Unlike most other CPUID leaves, leaf Bh will return different values in EDX depending on which logical processor the CPUID instruction runs; the value returned in EDX is actually the x2APIC id of the logical processor. The x2APIC id space is not continuously mapped to logical processors, however; there can be gaps in the mapping, meaning that some intermediate x2APIC ids don't necessarily correspond to any logical processor. Additional information for mapping the x2APIC ids to cores is provided in the other registers. Although the leaf Bh has sub-leaves (selected by ECX as described further below), the value returned in EDX is only affected by the logical processor on which the instruction is running but not by the subleaf.

The processor(s) topology exposed by leaf Bh is a hierarchical one, but with the strange caveat that the order of (logical) levels in this hierarchy doesn't necessarily correspond to the order in the physical hierarchy (SMT/core/package). However, every logical level can be queried as an ECX subleaf (of the Bh leaf) for its correspondence to a "level type", which can be either SMT, core, or "invalid". The level id space starts at 0 and is continuous, meaning that if a level id is invalid, all higher level ids will also be invalid. The level type is returned in bits 15:08 of ECX, while the number of logical processors at the level queried is returned in EBX. Finally, the connection between these levels and x2APIC ids is returned in EAX[4:0] as the number of bits that the x2APIC id must be shifted in order to obtain a unique id at the next level.

As an example, a dual-core Westmere processor capable of hyperthreading (thus having two cores and four threads in total) could have x2APIC ids 0, 1, 4 and 5 for its four logical processors. Leaf Bh (=EAX), subleaf 0 (=ECX) of CPUID could for instance return 100h in ECX, meaning that level 0 describes the SMT (hyperthreading) layer, and return 2 in EBX because there are two logical processors (SMT units) per physical core. The value returned in EAX for this 0-subleaf should be 1 in this case, because shifting the aforementioned x2APIC ids to the right by one bit gives a unique core number (at the next level of the level id hierarchy) and erases the SMT id bit inside each core. A simpler way to interpret this information is that the last bit (bit number 0) of the x2APIC id identifies the SMT/hyperthreading unit inside each core in our example. Advancing to subleaf 1 (by making another call to CPUID with EAX=Bh and ECX=1) could for instance return 201h in ECX, meaning that this is a core-type level, and 4 in EBX because there are 4 logical processors in the package; EAX returned could be any value greater than 3, because it so happens that bit number 2 is used to identify the core in the x2APIC id. Note that bit number 1 of the x2APIC id is not used in this example. However, EAX returned at this level could well be 4 (and it happens to be so on a Clarkdale Core i3 5x0) because that also gives a unique id at the package level (=0 obviously) when shifting the x2APIC id by 4 bits. Finally, you may wonder what the EAX=4 leaf can tell us that we didn't find out already. In EAX[31:26] it returns the APIC mask bits reserved for a package; that would be 111b in our example because bits 0 to 2 are used for identifying logical processors inside this package, but bit 1 is also reserved although not used as part of the logical processor identification scheme. In other words, APIC ids 0 to 7 are reserved for the package, even though half of these values don't map to a logical processor.

The cache hierarchy of the processor is explored by looking at the sub-leaves of leaf 4. The APIC ids are also used in this hierarchy to convey information about how the different levels of cache are shared by the SMT units and cores. To continue our example, the L2 cache, which is shared by SMT units of the same core but not between physical cores on the Westmere is indicated by EAX[26:14] being set to 1, while the information that the L3 cache is shared by the whole package is indicated by setting those bits to (at least) 111b. The cache details, including cache type, size, and associativity are communicated via the other registers on leaf 4.

Beware that older versions of the Intel app note 485 contain some misleading information, particularly with respect to identifying and counting cores in a multi-core processor;[18] errors from misinterpreting this information have even been incorporated in the Microsoft sample code for using CPUID, even for the 2013 edition of Visual Studio,[19] and also in the sandpile.org page for CPUID,[20] but the Intel code sample for identifying processor topology[16] has the correct interpretation, and the current Intel Software Developer’s Manual has a more clear language. The (open source) cross-platform production code[21] from Wildfire Games also implements the correct interpretation of the Intel documentation.

Topology detection examples involving older (pre-2010) Intel processors that lack x2APIC (thus don't implement the EAX=Bh leaf) are given in a 2010 Intel presentation.[22] Beware that using that older detection method on 2010 and newer Intel processors may overestimate the number of cores and logical processors because the old detection method assumes there are no gaps in the APIC id space, and this assumption is violated by some newer processors (starting with the Core i3 5x0 series), but these newer processors also come with an x2APIC, so their topology can be correctly determined using the EAX=Bh leaf method.

EAX=6: Thermal and power management

This returns information in EAX, EBX, ECX registers.

EAX=6 CPUID bits
Bit(s) EAX EBX ECX
Feature Value Feature
0 Digital Thermal Sensor (DTS) capability Number of Interrupt Thresholds Hardware Coordination Feedback capability
1 Intel Turbo Boost Technology capability ACNT2 Capability
2 Always Running APIC Timer (ARAT) capability (reserved)
3 (reserved) Performance-Energy Bias capability
4 Power Limit Notification (PLN) capability (reserved) (reserved)
5 Extended Clock Modulation Duty (ECMD) capability
6 Package Thermal Management (PTM) capability
31:7 (reserved)

EAX=7, ECX=0: Extended Features

This returns extended feature flags in EBX, ECX, and EDX. Returns the maximum ECX value for EAX=7 in EAX.

EAX=7,ECX=0 CPUID feature bits
Bit EBX ECX EDX
Short Feature Short Feature Short Feature
0 fsgsbaseAccess to base of %fs and %gs prefetchwt1PREFETCHWT1 instruction sgx-keysAttestation Services for Intel SGX
1 IA32_TSC_ADJUST MSR avx512-vbmiAVX-512 Vector Bit Manipulation Instructions (reserved)
2 sgxSoftware Guard Extensions umipUser-mode Instruction Prevention avx512-4vnniwAVX-512 4-register Neural Network Instructions
3 bmi1Bit Manipulation Instruction Set 1 pkuMemory Protection Keys for User-mode pages avx512-4fmapsAVX-512 4-register Multiply Accumulation Single precision
4 hleTSX Hardware Lock Elision ospkePKU enabled by OS fsrmFast Short REP MOVSB
5 avx2Advanced Vector Extensions 2 waitpkgTimed pause and user-level monitor/wait uintrUser Inter-processor Interrupts
6 FDP_EXCPTN_ONLY avx512-vbmi2AVX-512 Vector Bit Manipulation Instructions 2 (reserved)
7 smepSupervisor Mode Execution Prevention cet_ssControl flow enforcement (CET) shadow stack (reserved)
8 bmi2Bit Manipulation Instruction Set 2 gfniGalois Field instructions avx512-vp2intersectAVX-512 VP2INTERSECT Doubleword and Quadword Instructions
9 ermsEnhanced REP MOVSB/STOSB vaesVector AES instruction set (VEX-256/EVEX) srdbs-ctrlSpecial Register Buffer Data Sampling Mitigations
10 invpcidINVPCID instruction vpclmulqdqCLMUL instruction set (VEX-256/EVEX) mc-clearVERW instruction clears CPU buffers
11 rtmTSX Restricted Transactional Memory avx512-vnniAVX-512 Vector Neural Network Instructions rtm-always-abortAll TSX transactions are aborted
12 rdt-m/pqmIntel Resource Director (RDT) Monitoring or AMD Platform QOS Monitoring avx512-bitalgAVX-512 BITALG instructions (reserved)
13 FPU CS and FPU DS deprecated tmeIA32_TME related MSRs TSX_FORCE_ABORT MSR is available
14 mpxIntel MPX (Memory Protection Extensions) avx512-vpopcntdqAVX-512 Vector Population Count Double and Quad-word serializeSERIALIZE instruction
15 rdt-a/pqeIntel Resource Director (RDT) Allocation or AMD Platform QOS Enforcement (reserved) hybridMixture of CPU types in processor topology (eg. Alder Lake)
16 avx512-fAVX-512 Foundation la575-level paging (57 address bits) tsxldtrkTSXLDTRK instruction
17 avx512-dqAVX-512 Doubleword and Quadword Instructions mawauThe value of userspace MPX Address-Width Adjust used by the BNDLDX and BNDSTX Intel MPX instructions in 64-bit mode (reserved)
18 rdseedRDSEED instruction pconfigPlatform configuration (Memory Encryption Technologies Instructions)
19 adxIntel ADX (Multi-Precision Add-Carry Instruction Extensions) lbrArchitectural Last Branch Records
20 smapSupervisor Mode Access Prevention cet-ibtControl flow enforcement (CET) indirect branch tracking
21 avx512-ifmaAVX-512 Integer Fused Multiply-Add Instructions (reserved)
22 (reserved) rdpidRead Processor ID and IA32_TSC_AUX amx-bf16Tile computation on bfloat16 numbers
23 clflushoptCLFLUSHOPT instruction klKey Locker avx512-fp16AVX512-FP16 half-precision floating-point instructions[23]
24 clwbCLWB instruction BUS_LOCK_DETECT amx-tileTile architecture
25 ptIntel Processor Trace cldemoteCache line demote amx-int8Tile computation on 8-bit integers
26 avx512-pfAVX-512 Prefetch Instructions (reserved) IBRS_IBPB / spec_ctrlSpeculation Control, part of Indirect Branch Control (IBC):
Indirect Branch Restricted Speculation (IBRS) and
Indirect Branch Prediction Barrier (IBPB)[24][25]
27 avx512-erAVX-512 Exponential and Reciprocal Instructions movdiriMOVDIRI instruction stibpSingle Thread Indirect Branch Predictor, part of IBC[24]
28 avx512-cdAVX-512 Conflict Detection Instructions movdir64bMOVDIR64B L1D_FLUSHIA32_FLUSH_CMD MSR
29 shaSHA extensions enqcmdEnqueue Stores IA32_ARCH_CAPABILITIES (lists speculative side channel mitigations[24])
30 avx512-bwAVX-512 Byte and Word Instructions sgx-lcSGX Launch Configuration IA32_CORE_CAPABILITIES MSR (lists model-specific core capabilities)
31 avx512-vlAVX-512 Vector Length Extensions pksProtection keys for supervisor-mode pages ssbdSpeculative Store Bypass Disable,[24] as mitigation for Speculative Store Bypass (IA32_SPEC_CTRL)

EAX=7, ECX=1: Extended Features

This returns extended feature flags in EAX, EBX, and EDX. ECX is reserved.

EAX=7,ECX=1 CPUID feature bits
Bit EAX EBX EDX
ShortFeature ShortFeature ShortFeature
0 (reserved) IA32_PPIN and IA32_PPIN_CTL MSRs (reserved)
1 (reserved) (reserved) (reserved)
2 (reserved) (reserved) (reserved)
3 rao-intRAO-INT instructions (reserved) (reserved)
4 avx-vnniAVX Vector Neural Network Instructions (VNNI) (VEX encoded) (reserved) avx-vnn-int8AVX VNNI INT8 instructions
5 avx512-bf16AVX-512 instructions for bfloat16 numbers (reserved) avx-ne-convertAVX NE CONVERT instructions
6 lassLinear Address Space Separation (reserved) (reserved)
7 cmpccxaddCMPccXADD instructions (reserved) (reserved)
8 archperfmonextArchitectural Performance Monitoring Extended Leaf (EAX=23h) (reserved) amx-complexAMX support for "complex" tiles (TCMMIMFP16PS and TCMMRLFP16PS)
9 (reserved) (reserved) (reserved)
10 fast_zero_rep_movsbFast zero-length MOVSB (reserved) (reserved)
11 fast_short_rep_stosbFast zero-length STOSB (reserved) (reserved)
12 fast_short_rep_cmpsb_scasbFast zero-length CMPSB and SCASB (reserved) (reserved)
13 (reserved) (reserved) (reserved)
14 (reserved) (reserved) prefetchitiPREFETCHIT0 and PREFETCHIT1 instructions
15 (reserved) (reserved) (reserved)
16 (reserved) (reserved) (reserved)
17 fredFlexible Return and Event Delivery (reserved) (reserved)
18 lkgsLKGS Instruction (reserved) cet-sssControl-Flow Enforcement Technology Supervisor Shadow Stacks
19 wrmsrnsWRMSRNS instruction (reserved) (reserved)
20 (reserved) (reserved) (reserved)
21 amx-fp16AMX instructions for FP16 numbers (reserved) (reserved)
22 hresetHRESET instruction, IA32_HRESET_ENABLE MSR, and Processor History Reset Leaf (EAX=20h) (reserved) (reserved)
23 avx-ifmaAVX IFMA instructions (reserved) (reserved)
24 (reserved) (reserved) (reserved)
25 (reserved) (reserved) (reserved)
26 lamLinear Address Masking (reserved) (reserved)
27 msrlistRDMSRLIST and WRMSRLIST instructions, and the IA32_BARRIER MSR (reserved) (reserved)
28 (reserved) (reserved) (reserved)
29 (reserved) (reserved) (reserved)
30 (reserved) (reserved) (reserved)
31 (reserved) (reserved) (reserved)


EAX=0Dh, ECX=1

EAX=0Dh,ECX=1 CPUID feature bits
BitEAX
ShortFeature
0 xsaveoptXSAVEOPT instruction
1 xsavecXSAVEC instruction
2 xgetbv_ecx1XGETBV with ECX=1 support
3 xssXSAVES and XRSTORS instructions
4 (reserved)
5 (reserved)
6 (reserved)
7 (reserved)
8 (reserved)
9 (reserved)
10 (reserved)
11 (reserved)
12 (reserved)
13 (reserved)
14 (reserved)
15 (reserved)
16 (reserved)
17 (reserved)
18 (reserved)
19 (reserved)
20 (reserved)
21 (reserved)
22 (reserved)
23 (reserved)
24 (reserved)
25 (reserved)
26 (reserved)
27 (reserved)
28 (reserved)
29 (reserved)
30 (reserved)
31 (reserved)

EAX=12h, ECX=0: SGX Leaf Functions

EAX=0Dh,ECX=1 CPUID feature bits
BitEAX
ShortFeature
0 sgx1SGX1 leaf functions
1 sgx2SGX2 leaf functions
2 (reserved)
3 (reserved)
4 (reserved)
5 ossENCLV leaves: EINCVIRTCHILD, EDECVIRTCHILD, and ESETCONTEXT
6  ?ENCLS leaves: ETRACKC, ERDINFO, ELDBC, ELDUC
7  ?ENCLU leaf: EVERIFYREPORT2
8 (reserved)
9 (reserved)
10  ?ENCLS leaf: EUPDATESVN
11  ?ENCLU leaf: EDECSSA
12 (reserved)
13 (reserved)
14 (reserved)
15 (reserved)
16 (reserved)
17 (reserved)
18 (reserved)
19 (reserved)
20 (reserved)
21 (reserved)
22 (reserved)
23 (reserved)
24 (reserved)
25 (reserved)
26 (reserved)
27 (reserved)
28 (reserved)
29 (reserved)
30 (reserved)
31 (reserved)

EAX=14h, ECX=0

EAX=14h,ECX=0 CPUID feature bits
BitEBX
ShortFeature
0 (reserved)
1 (reserved)
2 (reserved)
3 (reserved)
4 ptwrite ?
5 (reserved)
6 (reserved)
7 (reserved)
8 (reserved)
9 (reserved)
10 (reserved)
11 (reserved)
12 (reserved)
13 (reserved)
14 (reserved)
15 (reserved)
16 (reserved)
17 (reserved)
18 (reserved)
19 (reserved)
20 (reserved)
21 (reserved)
22 (reserved)
23 (reserved)
24 (reserved)
25 (reserved)
26 (reserved)
27 (reserved)
28 (reserved)
29 (reserved)
30 (reserved)
31 (reserved)


EAX=19h

EAX=19h CPUID feature bits
BitEBX
ShortFeature
0 aes_kleAES "Key Locker" Instructions
1 (reserved)
2 aes_wide_klAES "Wide Key Locker" Instructions
3 (reserved)
4 kl_msrs"Key Locker" MSRs
5 (reserved)
6 (reserved)
7 (reserved)
8 (reserved)
9 (reserved)
10 (reserved)
11 (reserved)
12 (reserved)
13 (reserved)
14 (reserved)
15 (reserved)
16 (reserved)
17 (reserved)
18 (reserved)
19 (reserved)
20 (reserved)
21 (reserved)
22 (reserved)
23 (reserved)
24 (reserved)
25 (reserved)
26 (reserved)
27 (reserved)
28 (reserved)
29 (reserved)
30 (reserved)
31 (reserved)

EAX=80000000h: Get Highest Extended Function Implemented

The highest calling parameter is returned in EAX.

EAX=80000001h: Extended Processor Info and Feature Bits

This returns extended feature flags in EDX and ECX. Bits 0 through 9, 12 through 17, 23, and 24 of EDX are duplicates of EDX from the EAX=1 leaf.

AMD feature flags are as follows:[26][27]

EAX=80000001h CPUID feature bits
BitEDXECX
ShortFeatureShortFeature
0 fpuOnboard x87 FPUlahf_lmLAHF/SAHF in long mode
1 vmeVirtual mode extensions (VIF)cmp_legacyHyperthreading not valid
2 deDebugging extensions (CR4 bit 3)svmSecure Virtual Machine
3 psePage Size ExtensionextapicExtended APIC space
4 tscTime Stamp Countercr8_legacyCR8 in 32-bit mode
5 msrModel-specific registersabmAdvanced bit manipulation (lzcnt and popcnt)
6 paePhysical Address Extensionsse4aSSE4a
7 mceMachine Check ExceptionmisalignsseMisaligned SSE mode
8 cx8CMPXCHG8 (compare-and-swap) instruction3dnowprefetchPREFETCH and PREFETCHW instructions
9 apicOnboard Advanced Programmable Interrupt ControllerosvwOS Visible Workaround
10 (reserved)ibsInstruction Based Sampling
11 syscallSYSCALL and SYSRET instructionsxopXOP instruction set
12 mtrrMemory Type Range RegistersskinitSKINIT/STGI instructions
13 pgePage Global Enable bit in CR4wdtWatchdog timer
14 mcaMachine check architecture(reserved)
15 cmovConditional move and FCMOV instructionslwpLight Weight Profiling[28]
16 patPage Attribute Tablefma44 operands fused multiply-add
17 pse3636-bit page size extensiontceTranslation Cache Extension
18 (reserved)
19 mpMultiprocessor Capablenodeid_msrNodeID MSR
20 nxNX bit(reserved)
21 (reserved)tbmTrailing Bit Manipulation
22 mmxextExtended MMXtopoextTopology Extensions
23 mmxMMX instructionsperfctr_coreCore performance counter extensions
24 fxsrFXSAVE, FXRSTOR instructions, CR4 bit 9perfctr_nbNB performance counter extensions
25 fxsr_optFXSAVE/FXRSTOR optimizations(reserved)
26 pdpe1gbGigabyte pagesdbxData breakpoint extensions
27 rdtscpRDTSCP instructionperftscPerformance TSC
28 (reserved)pcx_l2iL2I perf counter extensions
29 lmLong modemonitorxMONITORX and MWAITX instructions
30 3dnowextExtended 3DNow!addr_mask_ext ?
31 3dnow3DNow!(reserved)

EAX=80000002h,80000003h,80000004h: Processor Brand String

These return the processor brand string in EAX, EBX, ECX and EDX. CPUID must be issued with each parameter in sequence to get the entire 48-byte null-terminated ASCII processor brand string.[29] It is necessary to check whether the feature is present in the CPU by issuing CPUID with EAX = 80000000h first and checking if the returned value is not less than 80000004h.

#include <stdio.h>
#include <string.h>
#include <cpuid.h>

int main()
{
    unsigned int regs[12];
    char str[sizeof(regs)];

    __cpuid(0x80000000, regs[0], regs[1], regs[2], regs[3]);

    if (regs[0] < 0x80000004)
        return 1;

    __cpuid(0x80000002, regs[0], regs[1], regs[2], regs[3]);
    __cpuid(0x80000003, regs[4], regs[5], regs[6], regs[7]);
    __cpuid(0x80000004, regs[8], regs[9], regs[10], regs[11]);

    memcpy(str, regs, sizeof(regs));
    printf("%s\n", str);

    return 0;
}

EAX=80000005h: L1 Cache and TLB Identifiers

This function contains the processor’s L1 cache and TLB characteristics.

EAX=80000006h: Extended L2 Cache Features

Returns details of the L2 cache in ECX, including the line size in bytes (Bits 07 - 00), type of associativity (encoded by a 4 bits field; Bits 15 - 12) and the cache size in KB (Bits 31 - 16).

#include <stdio.h>
#include <cpuid.h>

int main()
{
    unsigned int eax, ebx, ecx, edx;
    unsigned int lsize, assoc, cache;

    __cpuid(0x80000006, eax, ebx, ecx, edx);
    
    lsize = ecx & 0xff;
    assoc = (ecx >> 12) & 0x07;
    cache = (ecx >> 16) & 0xffff;

    printf("Line size: %d B, Assoc. type: %d, Cache size: %d KB.\n", lsize, assoc, cache);

    return 0;
}

EAX=80000007h: Advanced Power Management Information

This function provides advanced power management feature identifiers. EDX bit 8 indicates support for invariant TSC.

EAX=80000008h: Virtual and Physical address Sizes

EAX=80000008h CPUID feature bits
BitEAXEBXECX
ShortFeature
0 Number of Physical Address BitsclzeroCLZERO instructionNumber of Physical Cores (minus 1)
1 retired_instrRetired instruction count MSR
2 xrstor_fp_errXRSTOR restores FP errors
3 invlpgbINVLPGB and TLBSYNC instructions
4 rdpruRDPRU instruction
5 (reserved)
6 mbeMemory Bandwidth Enforcement
7 (reserved)
8 Number of Linear Address BitsmcommitMCOMMIT instruction(reserved)
9 wbnoinvdWBNOINVD instruction
10 (reserved)
11 (reserved)
12 ibpbIndirect Branch Prediction Barrierlog2 of maximum APIC ID
13 wbinvd_intWBINVD and WBNOINVD are interruptible
14 ibrsIBRS (AMD)
15 single_thread_ibpSingle Thread IBP
16 (reserved)(reserved)Performance Timestamp Counter size
17 single_thread_ibp_aoAlways on Single Thread IBP
18 ibrs_preferredIBRS preferred over software(reserved)
19 ibrs_same_mode_protectionIBRS provides Same Mode Protection
20 no_efer_lmsleEFER.LMSLE is unsupported
21 invlpgb_nestedINVLPGB support for nested pages
22 (reserved)
23 ppinProtected Processor Inventory Number
24 ssbdSpeculative Store Bypass Disable
25 ssbd_legacySpeculative Store Bypass Disable Legacy
26 ssbd_noSpeculative Store Bypass Disable Not Required
27 cppcCollaborative Processor Performance Control
28 psfdPredictive Store Forward Disable
29 btc_noBranch Type Confusion: Processor not affected
30 (reserved)
31 branch_samplingBranch Sampling Support

EDX provides information specific to RDPRU (the maximum register identifier allowed) in 31-16. The current number as of Zen 2 is 1 for MPERF and APERF.

EAX=8000001Fh: Encrypted Memory Capabilities

EAX=8000001Fh CPUID feature bits
BitEAX
ShortFeature
0 smeSecure Memory Encryption
1 sevSecure Encrypted Virtualization
2 page_flushPage flush MSR
3 sev_esSEV Encrypted State
4 sev_snpSEC Secure Nested Paging
5 vmplVM Permission Levels
6 (reserved)
7 (reserved)
8 (reserved)
9 (reserved)
10 hw_cache_coherency ?
11 64_host ?
12 restricted_injection ?
13 alternate_injection ?
14 debug_swap ?
15 prevent_host_ibs ?
16 vteVirtual Transparent Encryption
17 (reserved)
18 (reserved)
19 (reserved)
20 (reserved)
21 (reserved)
22 (reserved)
23 (reserved)
24 (reserved)
25 (reserved)
26 (reserved)
27 (reserved)
28 (reserved)
29 (reserved)
30 (reserved)
31 (reserved)

EAX=80000021h: Extended Feature Identification 2

EAX=80000021h CPUID feature bits
BitEAX
ShortFeature
0 NoNestedDataBpProcessor ignores nested data breakpoints
1 (reserved)
2 LFenceAlwaysSerializingLFENCE is always dispatch serializing
3 SmmPgCfgLockSMM paging configuration lock supported
4 (reserved)
5 (reserved)
6 NullSelectClearsBaseNull segment selector loads also clear the destination segment register base and limit
7 UpperAddressIgnoreUpper Address Ignore is supported
8 AutomaticIBRSAutomatic IBRS
9 NoSmmCtlMSRSMM_CTL MSR (C0010116h) is not supported
10 (reserved)
11 (reserved)
12 (reserved)
13 PrefetchCtlMsrPrefetchControl MSR (C0000108h) is supported
14 (reserved)
15 (reserved)
16 (reserved)
17 CpuidUserDisCPUID disable for non-privileged software
31:18 (reserved)
EAX=80000021h CPUID feature bits
BitEBX
ShortFeature
11:0 MicrocodePatchSizeThe size of the Microcode patch in 16-byte multiples. If 0, the size of the patch is at most 5568 (15C0h) bytes
31:12 (reserved)

EAX=8FFFFFFFh: AMD Easter Egg

Several AMD CPU models will, for CPUID with EAX=8FFFFFFFh, return an Easter Egg string in EAX, EBX, ECX and EDX.[30][31] Known Easter Egg strings include:

ProcessorString
AMD K6NexGenerationAMD
AMD K8IT'S HAMMER TIME
AMD Jaguar[32]HELLO KITTY! ^-^

CPUID usage from high-level languages

Inline assembly

This information is easy to access from other languages as well. For instance, the C code for gcc below prints the first five values, returned by the cpuid:

#include <stdio.h>
#include <cpuid.h>

int main()
{
    unsigned int i, eax, ebx, ecx, edx;

    for (i = 0; i < 5; i++) {
        __cpuid(i, eax, ebx, ecx, edx);
        printf ("InfoType %x\nEAX: %x\nEBX: %x\nECX: %x\nEDX: %x\n", i, eax, ebx, ecx, edx);
    }

    return 0;
}

In MSVC and Borland/Embarcadero C compilers (bcc32) flavored inline assembly, the clobbering information is implicit in the instructions:

#include <stdio.h>

int main()
{
    unsigned int a, b, c, d, i = 0;

    __asm {
        /* Do the call. */
        mov EAX, i;
        cpuid;
        /* Save results. */
        mov a, EAX;
        mov b, EBX;
        mov c, ECX;
        mov d, EDX;
    }

    printf ("InfoType %x\nEAX: %x\nEBX: %x\nECX: %x\nEDX: %x\n", i, a, b, c, d);
    return 0;
}

If either version was written in plain assembly language, the programmer must manually save the results of EAX, EBX, ECX, and EDX elsewhere if they want to keep using the values.

Wrapper functions

GCC also provides a header called <cpuid.h> on systems that have CPUID. The __cpuid is a macro expanding to inline assembly. Typical usage would be:

#include <stdio.h>
#include <cpuid.h>

int main()
{
    unsigned int eax, ebx, ecx, edx;

    __cpuid(0 /* vendor string */, eax, ebx, ecx, edx);
    printf("EAX: %x\nEBX: %x\nECX: %x\nEDX: %x\n", eax, ebx, ecx, edx);

    return 0;
}

But if one requested an extended feature not present on this CPU, they would not notice and might get random, unexpected results. Safer version is also provided in <cpuid.h>. It checks for extended features and does some more safety checks. The output values are not passed using reference-like macro parameters, but more conventional pointers.

#include <stdio.h>
#include <cpuid.h>

int main()
{
    unsigned int eax, ebx, ecx, edx;

    /* 0x81234567 is nonexistent, but assume it exists */
    if (!__get_cpuid (0x81234567, &eax, &ebx, &ecx, &edx)) {
        printf("Warning: CPUID request 0x81234567 not valid!\n");
        return 1;
    }

    printf("EAX: %x\nEBX: %x\nECX: %x\nEDX: %x\n", eax, ebx, ecx, edx);

    return 0;
}

Notice the ampersands in &a, &b, &c, &d and the conditional statement. If the __get_cpuid call receives a correct request, it will return a non-zero value, if it fails, zero.[33]

Microsoft Visual C compiler has builtin function __cpuid() so the cpuid instruction may be embedded without using inline assembly, which is handy since the x86-64 version of MSVC does not allow inline assembly at all. The same program for MSVC would be:

#include <stdio.h>
#ifdef __MSVC__
    #include <intrin.h>
#endif

int main()
{
    unsigned int regs[4];
    int i;

    for (i = 0; i < 4; i++) {
        __cpuid(regs, i);
        printf("The code %d gives %d, %d, %d, %d", regs[0], regs[1], regs[2], regs[3]);
    }

    return 0;
}

Many interpreted or compiled scripting languages are capable of using CPUID via an FFI library. One such implementation shows usage of the Ruby FFI module to execute assembly language that includes the CPUID opcode.

.NET 5 and later versions provide the System.Runtime.Intrinsics.X86.X86base.CpuId method. For instance, the C# code below prints the processor brand if it supports CPUID instruction:

using System.Runtime.InteropServices;
using System.Runtime.Intrinsics.X86;
using System.Text;

namespace X86CPUID {
    class CPUBrandString {
        public static void Main(string[] args) {
            if (!X86Base.IsSupported) {
                Console.WriteLine("Your CPU does not support CPUID instruction.");
            } else {
                Span<int> raw = stackalloc int[12];
                (raw[0], raw[1], raw[2],  raw[3])  = X86Base.CpuId(unchecked((int)0x80000002), 0);
                (raw[4], raw[5], raw[6],  raw[7])  = X86Base.CpuId(unchecked((int)0x80000003), 0);
                (raw[8], raw[9], raw[10], raw[11]) = X86Base.CpuId(unchecked((int)0x80000004), 0);

                Span<byte> bytes = MemoryMarshal.AsBytes(raw);
                string brand = Encoding.UTF8.GetString(bytes).Trim();
                Console.WriteLine(brand);
            }
        }
    }
}

CPU-specific information outside x86

Some of the non-x86 CPU architectures also provide certain forms of structured information about the processor's abilities, commonly as a set of special registers:

  • ARM architectures have a CPUIDcoprocessor register which requires EL1 or above to access.[34]
  • The IBM System z mainframe processors have a Store CPU ID (STIDP) instruction since the 1983 IBM 4381[35] for querying the processor ID.[36]
  • The IBM System z mainframe processors also have a Store Facilities List Extended (STFLE) instruction which lists the installed hardware features.[36]
  • The MIPS32/64 architecture defines a mandatory Processor Identification (PrId) and a series of daisy-chained Configuration Registers.[37]
  • The PowerPC processor has the 32-bit read-only Processor Version Register (PVR) identifying the processor model in use. The instruction requires supervisor access level.[38]

DSP and transputer-like chip families have not taken up the instruction in any noticeable way, in spite of having (in relative terms) as many variations in design. Alternate ways of silicon identification might be present; for example, DSPs from Texas Instruments contain a memory-based register set for each functional unit that starts with identifiers determining the unit type and model, its ASIC design revision and features selected at the design phase, and continues with unit-specific control and data registers. Access to these areas is performed by simply using the existing load and store instructions; thus, for such devices, there is no need for extending the register set for device identification purposes.

See also

References

  1. "Intel 64 and IA-32 Architectures Software Developer's Manual" (PDF). Intel.com. Retrieved 2013-04-11.
  2. "Detecting Intel Processors - Knowing the generation of a system CPU". Rcollins.org. Retrieved 2013-04-11.
  3. "LXR linux-old/arch/i386/kernel/head.S". Lxr.linux.no. Archived from the original on 2012-07-13. Retrieved 2013-04-11.
  4. "CPUID, EAX=4 - Strange results (Solved)". Software.intel.com. Retrieved 2014-07-10.
  5. instlatx64, CPUID dump for RDC IAD 100. Retrieved 22 December 2022.
  6. "ao486 CPUID instruction". GitHub. 12 March 2022.
  7. "v586: 586 compatible soft core for FPGA". GitHub. 6 December 2021.
  8. "Steam Hardware & Software Survey". store.steampowered.com. Retrieved 2022-07-26.
  9. "Fun with Timers and cpuid - by Jim Cownie - CPU fun". 3 March 2021.
  10. "Chapter 3 Instruction Set Reference, A-L" (PDF). Intel® 64 and IA-32 Architectures Software Developer's Manual. Intel Corporation. 2018-12-20. Retrieved 2018-12-20.
  11. http://bochs.sourceforge.net/techspec/24161821.pdf
  12. Huggahalli, Ram; Iyer, Ravi; Tetrick, Scott (2005). "Direct Cache Access for High Bandwidth Network I/O". ACM SIGARCH Computer Architecture News. 33 (2): 50–59. doi:10.1145/1080695.1069976. CiteSeerX:10.1.1.91.957.
  13. Drepper, Ulrich (2007), What Every Programmer Should Know About Memory, CiteSeerX:10.1.1.91.957
  14. "Mechanisms to determine if software is running in a VMware virtual machine". VMware Knowledge Base. VMWare. 2015-05-01. Intel and AMD CPUs have reserved bit 31 of ECX of CPUID leaf 0x1 as the hypervisor present bit. This bit allows hypervisors to indicate their presence to the guest operating system. Hypervisors set this bit and physical CPUs (all existing and future CPUs) set this bit to zero. Guest operating systems can test bit 31 to detect if they are running inside a virtual machine.
  15. Kataria, Alok; Hecht, Dan (2008-10-01). "Hypervisor CPUID Interface Proposal". LKML Archive on lore.kernel.org. Archived from the original on 2019-03-15. Bit 31 of ECX of CPUID leaf 0x1. This bit has been reserved by Intel & AMD for use by hypervisors and indicates the presence of a hypervisor. Virtual CPU's (hypervisors) set this bit to 1 and physical CPU's (all existing and future CPU's) set this bit to zero. This bit can be probed by the guest software to detect whether they are running inside a virtual machine.
  16. Shih Kuo (Jan 27, 2012). "Intel® 64 Architecture Processor Topology Enumeration".
  17. "Processor and Core Enumeration Using CPUID | AMD". Developer.amd.com. Archived from the original on 2014-07-14. Retrieved 2014-07-10.
  18. "Sandybridge processors report incorrect core number?". Software.intel.com. 2012-12-29. Retrieved 2014-07-10.
  19. "cpuid, __cpuidex". Msdn.microsoft.com. 2014-06-20. Retrieved 2014-07-10.
  20. "x86 architecture - CPUID". sandpile.org. Retrieved 2014-07-10.
  21. "topology.cpp in ps/trunk/source/lib/sysdep/arch/x86_x64 – Wildfire Games". Trac.wildfiregames.com. 2011-12-27. Retrieved 2014-07-10.
  22. Hyper-Threading Technology and Multi-Core Processor Detection
  23. https://software.intel.com/content/www/us/en/develop/download/intel-avx512-fp16-architecture-specification.html
  24. "Speculative Execution Side Channel Mitigations" (PDF). Revision 2.0. Intel. May 2018 [January 2018]. Document Number: 336996-002. Retrieved 2018-05-26.
  25. "IBRS patch series [LWN.net]".
  26. CPUID Specification (PDF), AMD, September 2010, retrieved 2013-04-02
  27. Linux kernel source code
  28. Lightweight Profiling Specification (PDF), AMD, August 2010, retrieved 2013-04-03
  29. "Intel® Processor Identification and the CPUID Instruction" (PDF). Download.intel.com. 2012-03-06. Retrieved 2013-04-11.
  30. Ferrie, Peter. "Attacks on Virtual Machine Emulators" (PDF). symantec.com. Symantec Advanced Threat Research. Archived from the original (PDF) on 2007-02-07. Retrieved 15 March 2017.
  31. Sandpile, x86 architecture CPUID. Retrieved 22 December 2022.
  32. instlatx64, CPUID dump of AMD A4-5000, lists "HELLO KITTY" string for CPUID leaf 8FFFFFFFh. Retrieved 22 December 2022.
  33. "GCC-mirror/GCC". GitHub. 13 March 2022.
  34. "ARM Information Center". Infocenter.arm.com. Retrieved 2013-04-11.
  35. "Processor version codes and SRM constants". Archived from the original on 2014-09-08. Retrieved 2014-09-08.
  36. "IBM System z10 Enterprise Class Technical Guide" (PDF).
  37. "MIPS32 Architecture For Programmers, Volume III: The MIPS32 Privileged Resource Architecture" (PDF). MIPS Technologies, Inc. 2001-03-12.
  38. "PowerPC Operating Environment Architecture, book III" (PDF).

Further reading

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.