Ascend
Member
Not really. Their SM in Ampere can do either 128 FP32 OR 64 FP32 + 64 INT32. The only time you will reach the advertised TF is when there are ZERO INT32 operations. At any other time, the TF is literally cut in half for the SM doing the INT operation.They are just reporting the basic mathematics, as does everyone.
What is left to be seen is how efficiently the architecture utilizes the stream processors. Historically, nVidia has had the edge in this area with AMD having higher sp counts (and TF numbers) but being on the short end of performance.
Last edited: