Alibaba Cloud throws “bombshell”!
On May 21, Ali Cloud threw a bombshell: Tongyi thousand ask GFT-4 main model Qwen-Long, API input price dropped from 0.02 yuan / 1,000 tokens to 0.0005 yuan / 1,000 tokens, down 97%. This means that 1 yuan can buy 2 million tokens, which is equivalent to the volume of 5 Xinhua dictionaries. This model supports up to 10 million tokens long text input, and after the price reduction, it is about 1/400 of the price of GPT-4, breaking down the global reserve price.
In this connection, the relevant person in charge of the volcano engine exclusively responded to the first financial reporter that he is very welcome to reduce the price of the large model of Tongyi, and jointly help enterprises to explore AI transformation at a lower cost and accelerate the landing of large model application scenarios. According to reports, at the same time of substantial price reduction, the Doubao large model also provides customers with the highest standard TPM (Tokens per minute) and RPM (requests per minute) in the industry, and the processing quota of Tokens per minute reaches several times that of the industry’s model of the same specification, and can support a large number of concurrent requests, which helps enterprises to call the large model in the production system.