1, Chinese proofreading software development process
In 1993, we began to develop the "Verse Text Proofreading" 1.0 edition. In April 1994, we made a demonstration disk and participated in the "Beijing Fair." During the six-day exhibition, we sold more than 60 sets of proofreading software.
After the first victory, proofreading software became the company's most important product. However, even though we grabbed the initiative, in early 1995 we were caught in the dilemma that many new technology companies have encountered. It seems that overnight, seven or eight companies’ Chinese proofreading software emerged. After arduous efforts and improvement of six versions, dark horse text proofing finally got a firm foothold in the market.
Based on the traditional binary search method, we used a special technique to convert the knowledge base and create an index, which can save 50% to 66% of the storage space, which means that the operating speed can be increased by two to three times. We also have a technology that can greatly reduce the number of hard disk queries by adding a search filter, which can reduce the query volume by up to 80%.
With these technical means, we further accumulate and increase the capacity of the knowledge base. On the one hand, it collects corpus through various means; on the other hand, it strengthens corpus processing capabilities. By 1998, we have accumulated more than one billion Chinese characters in various languages.
Since 1996, we have begun to develop the Dark Horse proofreading software Word version, and directly proofread the documents edited in Word. However, due to some limitations, it is difficult to achieve the functions required for professional proofreading in Word. Therefore, we are determined to develop a full 32-bit editorial proofreading software, which is now edited 98.
The Chinese version of the Windows proofreading software needs to solve the problem of file format processing. Relatively speaking, PS is an international standard format, S2 file we have previously done text conversion, and Word file format is not open, but fortunately the Word file can be saved as RTF format, so that we have a solution. In the era of Windows 95/98, the Chinese proofreading software finally stood the challenge and test.
2. The Future of Chinese Proofreading
On the one hand, proofreading should be developed in the direction of specialization; on the other hand, with the gradual maturity of the proofreading function, we are also working hard to popularize it, especially to enter mainstream typesetting systems and mainstream word processing software. We have already cooperated with IBM, specifically in Lotus WordPro; we have also worked with Lenovo's Chinese system to add proofreading capabilities to Lenovo Office. ?
In the Internet tide, we can use the Internet to develop proofreading services. Since 1997, we have launched an online proofreading service. Users only need to email the documents they want to proofread to us. After proofreading the software with the computer, we will send the proofreading results back to the user. We will not collect any proofs during the entire process. cost. Through more than one year of operation, we have more than 80 users
From the perspective of proofreading, a corpus currently requires at least 6.4 billion to 40 billion Chinese characters. The collection, sorting, processing, and application of such a large corpora require a huge investment, and it is only for oversimplification that it is only used for word proofing. Moreover, the proofreading is based on contextual analysis, which is also the foundation of many other software, such as keyboard intelligent input, OCR recognition post-processing, and voice. Therefore, we hope to establish a shared corpus in a certain way and develop Chinese proofreading as information. The sharp weapon of handling is the greatest wish of the dark horses.
This information is reproduced in full text, does not represent the site claims