CN110852091A - Method and device for monitoring wrongly written characters, electronic equipment and computer readable medium - Google Patents

Method and device for monitoring wrongly written characters, electronic equipment and computer readable medium Download PDF

Info

Publication number
CN110852091A
CN110852091A CN201911097666.2A CN201911097666A CN110852091A CN 110852091 A CN110852091 A CN 110852091A CN 201911097666 A CN201911097666 A CN 201911097666A CN 110852091 A CN110852091 A CN 110852091A
Authority
CN
China
Prior art keywords
typo
keyword
initial
error level
library
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911097666.2A
Other languages
Chinese (zh)
Other versions
CN110852091B (en
Inventor
蔡建科
范渊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Dbappsecurity Technology Co Ltd
Original Assignee
Hangzhou Dbappsecurity Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Dbappsecurity Technology Co Ltd filed Critical Hangzhou Dbappsecurity Technology Co Ltd
Priority to CN201911097666.2A priority Critical patent/CN110852091B/en
Publication of CN110852091A publication Critical patent/CN110852091A/en
Application granted granted Critical
Publication of CN110852091B publication Critical patent/CN110852091B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Machine Translation (AREA)

Abstract

本发明提供了一种错别字的监测方法、装置、电子设备和计算机可读介质,涉及网络安全的技术领域,包括:获取待检测网页中的文字;基于关键字字库和忽略字库,确定文字中是否存在错别字;基于关键字字库中的关键字的错误等级,确定出错别字的错误等级;若错别字的错误等级高于或等于预设等级,则将错别字发送给服务器,以使服务器基于错别字向用户发送告警信息,解决了现有技术中错别字的识别误报率较高的技术问题。

Figure 201911097666

The present invention provides a method, device, electronic device and computer-readable medium for monitoring typos, and relates to the technical field of network security, including: acquiring text in a webpage to be detected; There is a typo; based on the error level of the keyword in the keyword font, the error level of the typo is determined; if the error level of the typo is higher than or equal to the preset level, the typo is sent to the server, so that the server sends the user based on the typo to the user. The alarm information solves the technical problem of a high false alarm rate in the prior art for identifying typos.

Figure 201911097666

Description

错别字的监测方法、装置、电子设备和计算机可读介质Method, apparatus, electronic device, and computer-readable medium for detecting typos

技术领域technical field

本发明涉及网络安全技术领域,尤其是涉及一种错别字的监测方法、装置、电子设备和计算机可读介质。The present invention relates to the technical field of network security, and in particular, to a method, device, electronic device and computer-readable medium for detecting typos.

背景技术Background technique

我国已经进入信息化时代,信息化给我们带来便利的同时,也给我们带来了烦恼。网站或公文中存在严重的文字错误,很容易被认为工作态度有问题。在网民、媒体广泛关注的互联网时代,一个严重的文字错误极易被网民、媒体利用,成为炒作的题材,其负面影响不次于一次网站被黑。所以,提高网站错别字监测告警技术就成为我们在信息化时代面临的重要任务。但是如何准确高效的确定出网页中的错别字成为了一种亟待解决的问题。Our country has entered the age of informationization. While informationization brings us convenience, it also brings us troubles. Serious typographical errors in a website or official document can easily be seen as a problem with work attitude. In the Internet age where netizens and the media are widely concerned, a serious word error can easily be exploited by netizens and the media and become the subject of speculation, and its negative impact is no less than a website hack. Therefore, improving the website typo detection and warning technology has become an important task we face in the information age. However, how to accurately and efficiently determine the typos in web pages has become an urgent problem to be solved.

针对上述问题,还未提出有效的解决方案。For the above problems, no effective solutions have been proposed yet.

发明内容SUMMARY OF THE INVENTION

有鉴于此,本发明的目的在于提供一种错别字的监测方法、装置、电子设备和计算机可读介质,以缓解了现有技术中错别字的识别误报率较高的技术问题。In view of this, the purpose of the present invention is to provide a method, device, electronic device and computer readable medium for detecting typos, so as to alleviate the technical problem of high false alarm rate of typos in the prior art.

第一方面,本发明实施例提供了一种错别字的监测方法,应用于云平台,包括:获取待检测网页中的文字;基于关键字字库和忽略字库,确定所述文字中是否存在错别字;基于所述关键字字库中的关键字的错误等级,确定出所述错别字的错误等级;若所述错别字的错误等级高于或等于预设等级,则将所述错别字发送给服务器,以使所述服务器基于所述错别字向用户发送告警信息。In a first aspect, an embodiment of the present invention provides a method for monitoring typos, which is applied to a cloud platform, including: acquiring text in a webpage to be detected; The error level of the keywords in the keyword font library determines the error level of the typo; if the error level of the typo is higher than or equal to a preset level, the typo is sent to the server, so that the The server sends alert information to the user based on the typo.

进一步地,基于关键字字库和忽略字库,确定所述文字中是否存在错别字,包括:将所述文字与所述关键字字库中的关键字进行匹配,确定出所述文字中的初始错别字,其中,所述关键字字库中的关键字包括:所述云平台预设的关键字,所述用户设定的自定义关键字;对比所述初始错别字与所述忽略字库中的关键字,判断所述忽略字库中的关键字是否包含所述初始错别字;若判断出所述忽略字库中的关键字包含所述初始错别字,则确定所述初始错别字为所述错别字。Further, determining whether there is a typo in the text based on the keyword font library and the ignored font library includes: matching the text with the keywords in the keyword font library, and determining the initial typo in the text, wherein , the keywords in the keyword font library include: keywords preset by the cloud platform, custom keywords set by the user; whether the keyword in the ignored word library contains the initial typo; if it is determined that the keyword in the ignored word library contains the initial typo, then the initial typo is determined to be the typo.

进一步地,所述方法还包括:若判断出所述忽略字库中的关键字不包含所述初始错别字,则确定所述初始错别字不是所述错别字。Further, the method further includes: if it is determined that the keyword in the ignored word library does not contain the initial typo, determining that the initial typo is not the typo.

进一步地,所述方法还包括:若所述错别字的错误等级低于所述预设等级,则基于所述错别字生成错别字报表。Further, the method further includes: if the error level of the typo is lower than the preset level, generating a typo report based on the typo.

进一步地,基于所述关键字字库中的关键字的错误等级,确定出所述错别字的错误等级,包括:将与所述错别字相匹配的关键字字库中的关键字的错误等级确定为所述错别字的错误等级,其中,所述错误等级用于表征所述错别字的危险程度。Further, determining the error level of the misspelled word based on the error level of the keyword in the keyword font library, comprising: determining the error level of the keyword in the keyword font library matching the misspelled word as the The error level of the typo, wherein the error level is used to characterize the degree of danger of the typo.

第二方面,本发明实施例提供了一种错别字的监测装置,用于云平台,包括:获取单元,第一确定单元,第二确定单元和发送单元,其中,所述获取单元用于获取待检测网页中的文字;所述第一确定单元用于基于关键字字库和忽略字库,确定所述文字中是否存在错别字;所述第二确定单元用于基于所述关键字字库中的关键字的错误等级,确定出所述错别字的错误等级,其中,所述错误等级用于表征所述错别字的危险程度;所述发送单元用于若所述错别字的错误等级高于或等于预设等级,则将所述错别字发送给服务器,以使所述服务器基于所述错别字向用户发送告警信息。In a second aspect, an embodiment of the present invention provides a device for monitoring typos, which is used in a cloud platform, and includes: an acquisition unit, a first determination unit, a second determination unit, and a sending unit, wherein the acquisition unit is used to acquire a waiting unit. Detect the text in the webpage; the first determining unit is used for determining whether there is a typo in the text based on the keyword font library and the ignore font library; the second determining unit is used for determining whether there is a typo in the keyword font Error level, to determine the error level of the typo, wherein the error level is used to represent the danger level of the typo; the sending unit is used for if the error level of the typo is higher than or equal to a preset level, then The typo is sent to the server, so that the server sends alert information to the user based on the typo.

进一步地,所述第一确定单元还用于:将所述文字与所述关键字字库中的关键字进行匹配,确定出所述文字中的初始错别字,其中,所述关键字字库中的关键字包括:所述云平台预设的关键字,所述用户设定的自定义关键字;对比所述初始错别字与所述忽略字库中的关键字,判断所述忽略字库中的关键字是否包含所述初始错别字;若判断出所述忽略字库中的关键字包含所述初始错别字,则确定所述初始错别字为所述错别字。Further, the first determining unit is further configured to: match the text with the keywords in the keyword word library, and determine the initial typo in the text, wherein the key words in the keyword word library are Words include: keywords preset by the cloud platform, custom keywords set by the user; comparing the initial typos with the keywords in the ignored word library, and judging whether the keywords in the ignored word library contain the initial typo; if it is determined that the keyword in the ignored word library contains the initial typo, the initial typo is determined to be the typo.

进一步地,所述第一确定单元还用于:若判断出所述忽略字库中的关键字不包含所述初始错别字,则确定所述初始错别字不是所述错别字。Further, the first determining unit is further configured to: determine that the initial typo is not the typo if it is determined that the keyword in the ignored word library does not contain the initial typo.

第三方面,本发明实施例还提供了一种电子设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现上述第一方面中任一项所述的方法的步骤。In a third aspect, an embodiment of the present invention further provides an electronic device, including a memory, a processor, and a computer program stored on the memory and executable on the processor, where the processor executes the computer program When implementing the steps of the method in any one of the above first aspects.

第四方面,本发明实施例还提供了一种具有处理器可执行的非易失的程序代码的计算机可读介质,所述程序代码使所述处理器执行上述第一方面中任一所述方法。In a fourth aspect, an embodiment of the present invention further provides a computer-readable medium having a processor-executable non-volatile program code, where the program code enables the processor to execute any one of the foregoing first aspects. method.

在本发明实施例中,获取待检测网页中的文字;基于关键字字库和忽略字库,确定文字中是否存在错别字;基于关键字字库中的关键字的错误等级,确定出错别字的错误等级,其中,错误等级用于表征错别字的危险程度;若错别字的错误等级高于或等于预设等级,则将错别字发送给服务器,以使服务器基于错别字向用户发送告警信息,达到了降低了待检测网页中的错别字的识别误报率,以及对待检测网页中的错别字进行告警的目的,进而解决了现有技术中错别字的识别误报率较高的技术问题,从而实现了降低了待检测网页中的错别字的识别误报率的技术效果。In the embodiment of the present invention, the text in the webpage to be detected is obtained; based on the keyword font library and the ignored font library, it is determined whether there is a typo in the text; based on the error level of the keywords in the keyword font library, the error level of the typo word is determined, wherein , the error level is used to represent the danger level of the typo; if the error level of the typo is higher than or equal to the preset level, the typo will be sent to the server, so that the server will send an alarm message to the user based on the typo, which reduces the number of pages to be detected. and the purpose of alerting the typo in the web page to be detected, thereby solving the technical problem of the high recognition false alarm rate of the typo in the prior art, thereby reducing the number of typos in the web page to be detected. The technical effect of identifying the false positive rate.

本发明的其他特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本发明而了解。本发明的目的和其他优点在说明书、权利要求书以及附图中所特别指出的结构来实现和获得。Other features and advantages of the present invention will be set forth in the description which follows, and in part will be apparent from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the description, claims and drawings.

为使本发明的上述目的、特征和优点能更明显易懂,下文特举较佳实施例,并配合所附附图,作详细说明如下。In order to make the above-mentioned objects, features and advantages of the present invention more obvious and easy to understand, preferred embodiments are given below, and are described in detail as follows in conjunction with the accompanying drawings.

附图说明Description of drawings

为了更清楚地说明本发明具体实施方式或现有技术中的技术方案,下面将对具体实施方式或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施方式,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the specific embodiments of the present invention or the technical solutions in the prior art more clearly, the following briefly introduces the accompanying drawings that need to be used in the description of the specific embodiments or the prior art. Obviously, the accompanying drawings in the following description The drawings are some embodiments of the present invention. For those of ordinary skill in the art, other drawings can also be obtained based on these drawings without creative efforts.

图1为本发明实施例提供的一种错别字的监测方法的流程图;1 is a flowchart of a method for monitoring a typo provided in an embodiment of the present invention;

图2为本发明实施例提供的一种错别字的确定方法的流程图;2 is a flowchart of a method for determining a typo provided by an embodiment of the present invention;

图3为本发明实施例提供的一种错别字的监测装置的示意图;3 is a schematic diagram of a device for monitoring typos provided by an embodiment of the present invention;

图4为本发明实施例提供的一种服务器的示意图。FIG. 4 is a schematic diagram of a server according to an embodiment of the present invention.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合附图对本发明的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings. Obviously, the described embodiments are part of the embodiments of the present invention, but not all of them. example. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

在现有技术中,由于早期对网站错别字监测告警的不够重视,仅仅只能识别网站错别字,存在各方面的误报,无法满足网站管理人员要求。目前的技术,识别出来的错别字很多是已经通用或者混用的词语,用户不认为是错误的。同时目前的错别字监测结果,由于没有好的告警方式,管理人员无法实时掌握,快速发现,迅速修改。In the prior art, due to insufficient attention paid to website typo monitoring alarms in the early stage, only website typos can be identified, and there are false positives in various aspects, which cannot meet the requirements of website administrators. With the current technology, many of the identified typos are words that have been commonly used or mixed, and users do not think they are wrong. At the same time, due to the lack of a good warning method for the current typo monitoring results, managers cannot grasp it in real time, find it quickly, and modify it quickly.

网站错别字即网站的内容中存在文字错误。网站错别字问题由来已久,既有人为录入错误,也有大量使用O C R(Optical Character Recognition,光学字符识别)扫描设备形成的O C R识别错误。根据权威机构统计,目前中国网站平均存在错别字在5000个以上,错别字发生率在万分之八以上,而且随着O C R扫描设备的大量装备和发稿量激增,网站错别字发生率将呈快速增长趋势。Website typos are textual errors in the content of a website. The problem of typos on websites has a long history, including human input errors and OCR recognition errors caused by a large number of OCR (Optical Character Recognition, Optical Character Recognition) scanning equipment. According to statistics from authoritative organizations, there are currently more than 5,000 typos on Chinese websites on average, and the incidence of typos is more than 8 in 10,000. Moreover, with the large number of OCR scanning equipment and the surge in the number of publications, the incidence of typos on websites will show a rapid growth trend.

针对上述问题,提出以下方法,用于解决上述问题。In view of the above problems, the following methods are proposed to solve the above problems.

实施例一:Example 1:

根据本发明实施例,提供了一种错别字的监测方法的实施例,需要说明的是,在附图的流程图示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行,并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。According to an embodiment of the present invention, an embodiment of a method for detecting typos is provided. It should be noted that the steps shown in the flowchart of the accompanying drawings may be executed in a computer system such as a set of computer-executable instructions, and, Although a logical order is shown in the flowcharts, in some cases steps shown or described may be performed in an order different from that herein.

图1是根据本发明实施例的一种错别字的监测方法的流程图,如图1所示,该方法包括如下步骤:Fig. 1 is a flow chart of a method for monitoring typos according to an embodiment of the present invention. As shown in Fig. 1, the method comprises the following steps:

步骤S102,获取待检测网页中的文字;Step S102, acquiring the text in the webpage to be detected;

步骤S104,基于关键字字库和忽略字库,确定所述文字中是否存在错别字;Step S104, based on the keyword font library and the ignored font library, determine whether there is a typo in the text;

步骤S106,基于所述关键字字库中的关键字的错误等级,确定出所述错别字的错误等级;Step S106, determining the error level of the typo based on the error level of the keyword in the keyword word library;

步骤S108,若所述错别字的错误等级高于或等于预设等级,则将所述错别字发送给服务器,以使所述服务器基于所述错别字向用户发送告警信息。Step S108, if the error level of the typo is higher than or equal to a preset level, send the typo to the server, so that the server sends an alarm message to the user based on the typo.

在本发明实施例中,通过获取待检测网页中的文字;基于关键字字库和忽略字库,确定文字中是否存在错别字;基于关键字字库中的关键字的错误等级,确定出错别字的错误等级,其中,错误等级用于表征错别字的危险程度;若错别字的错误等级高于或等于预设等级,则将错别字发送给服务器,以使服务器基于错别字向用户发送告警信息,达到了降低了待检测网页中的错别字的识别误报率,以及对待检测网页中的错别字进行告警的目的,进而解决了现有技术中错别字的识别误报率较高的技术问题,从而实现了降低了待检测网页中的错别字的识别误报率的技术效果。In the embodiment of the present invention, by acquiring the text in the webpage to be detected; determining whether there are typos in the text based on the keyword font library and the ignore font library; Among them, the error level is used to represent the danger level of the typo; if the error level of the typo is higher than or equal to the preset level, the typo will be sent to the server, so that the server will send an alarm message to the user based on the typo, which reduces the number of pages to be detected. The recognition false alarm rate of the typos in the web page, and the purpose of alarming the typos in the web page to be detected, further solves the technical problem of the high recognition false alarm rate of the typos in the prior art, thereby reducing the number of typos in the web page to be detected. The technical effect of typo recognition false positive rate.

需要说明的是,上述的云平台是用于基于远程监测技术可以有效的监测到网站错别字,同时可以自定义设置关键字、自定义忽略库配置和告警联系人的网站监测云计算平台。同时能对网站错别字进行短信、邮件等形式的告警。It should be noted that the above cloud platform is a website monitoring cloud computing platform that can effectively monitor website typos based on remote monitoring technology, and can customize keywords, custom ignore library configuration, and alert contacts. At the same time, it can send alerts in the form of text messages, emails and other forms of website typos.

另外,还需要说明的是,上述获取待检测网页中的文字可以采用爬虫软件爬取文字的方法。In addition, it should also be noted that the above-mentioned method for acquiring the text in the webpage to be detected may be a method of crawling text by crawler software.

另外,向用户发送告警信息可以根据用户提前设置好的告警时间段及告警方式进行判断,若在时间段内则根据不同的告警方式调用短信、邮件、电话等接口对用户进行告警,从而使用户能够及时的获取到告警信息,并对错别字进行修改,提高了用户的工作效率。In addition, the alarm information sent to the user can be judged according to the alarm time period and the alarm method set in advance by the user. The alarm information can be obtained in time, and the typos can be modified, which improves the user's work efficiency.

在本发明实施例中,当错别字的错误等级低于预设等级,则基于所述错别字生成错别字报表。需要说明的是,错别字的错误等级与错别字相匹配的关键字字库中的关键字的错误等级相同。In this embodiment of the present invention, when the error level of the typo is lower than a preset level, a typo report is generated based on the typo. It should be noted that the error level of the misspelled word is the same as the error level of the keyword in the keyword font database whose misspelled character matches.

在本发明实施例中,如图2所示,步骤S104还包括如下步骤:In this embodiment of the present invention, as shown in FIG. 2 , step S104 further includes the following steps:

步骤S11,将所述文字与所述关键字字库中的关键字进行匹配,确定出所述文字中的初始错别字,其中,所述关键字字库中的关键字包括:所述云平台预设的关键字,所述用户设定的自定义关键字;Step S11: Match the text with the keywords in the keyword word library, and determine the initial typos in the text, wherein the keywords in the keyword word library include: preset by the cloud platform. keyword, a custom keyword set by the user;

步骤S2,对比所述初始错别字与所述忽略字库中的关键字,判断所述忽略字库中的关键字是否包含所述初始错别字;Step S2, compares the keyword in the described initial typo and the ignored word library, and judges whether the keyword in the ignored word library includes the initial typo;

步骤S13,若判断出所述忽略字库中的关键字不包含所述初始错别字,则确定所述初始错别字为所述错别字。Step S13, if it is determined that the keyword in the ignored word library does not contain the initial typo, then determine that the initial typo is the typo.

在本发明实施例中,建立统一的云平台连接错别字引擎,平台允许客户自定义添加关键字和自定义忽略库配置,同时平台存在默认错别字关键字。针对所属关键字可以进行等级设置,上述的错误等级可以根据用于的实际需求自行设定,一般情况下包括:高危错别字、中危错别字、低危错别字、忽略错别字,按照顺序上述的安全等级所表征的错别字的危险程度依次下降。In the embodiment of the present invention, a unified cloud platform is established to connect the typo engine, the platform allows customers to add keywords and custom ignore library configuration, and the platform has default typo keywords. The level of the keyword can be set. The above error levels can be set according to the actual needs of the application. Generally, they include: high-risk typos, medium-risk typos, low-risk typos, and ignore typos. The above-mentioned security levels are in order. The degree of danger of typos of the representation decreased in order.

云平台根据默认关键字和自定义关键字,通过匹配相关关键字结果进行判断是否存在错别字。The cloud platform judges whether there are typos by matching the results of related keywords according to the default keywords and custom keywords.

然后将错别字结果与自定义忽略库配置结果进行比对,确认本次扫描的最终错别字结果。防止部分已经通用或者混用的词语,用户不认为是错误的错别字进行告警,从而降低了现有技术中对待检测网页的错别字的识别误报率。Then compare the typo result with the custom ignore library configuration result to confirm the final typo result of this scan. It prevents some words that have been used in common or mixed use, and the user does not think that the typo is wrong, thereby reducing the false alarm rate of the typo of the webpage to be detected in the prior art.

另外,还需要说明的是,如果判断出忽略字库中的关键字包含初始错别字,那么确定初始错别字不是所述错别字,无需发出告警信息,从而减少了错别字识别的告警信息,降低了用户的工作量。In addition, it should also be noted that if it is determined that the keyword in the ignored word library contains the initial typo, then it is determined that the initial typo is not the typo, and no alarm information is required, thereby reducing the alarm information for typo recognition and reducing the workload of the user .

实施例二:Embodiment 2:

本发明实施例还提供了一种错别字的监测装置,该错别字的监测装置主要用于执行本发明实施例上述内容所提供的错别字的监测方法,以下对本发明实施例提供的错别字的监测装置做具体介绍。An embodiment of the present invention also provides a device for monitoring typos. The device for monitoring typo is mainly used to execute the method for monitoring typo provided by the above content of the embodiment of the present invention. The following describes the device for monitoring typo provided by the embodiment of the present invention in detail. introduce.

图3是根据本发明实施例的一种错别字的监测装置的示意图,如图3所示,该错别字的监测装置主要包括:获取单元10,第一确定单元20,第二确定单元30和发送单元40。FIG. 3 is a schematic diagram of an apparatus for monitoring typos according to an embodiment of the present invention. As shown in FIG. 3 , the apparatus for monitoring typos mainly includes: an acquisition unit 10 , a first determination unit 20 , a second determination unit 30 and a sending unit 40.

所述获取单元10用于获取待检测网页中的文字;The obtaining unit 10 is used to obtain the text in the webpage to be detected;

所述第一确定单元20用于基于关键字字库和忽略字库,确定所述文字中是否存在错别字;The first determining unit 20 is used to determine whether there is a typo in the text based on the keyword font library and the ignore font library;

所述第二确定单元30用于基于所述关键字字库中的关键字的错误等级,确定出所述错别字的错误等级,其中,所述错误等级用于表征所述错别字的危险程度;The second determining unit 30 is configured to determine the error level of the typo based on the error level of the keywords in the keyword word library, wherein the error level is used to represent the degree of danger of the typo;

所述发送单元40用于若所述错别字的错误等级高于或等于预设等级,则将所述错别字发送给服务器,以使所述服务器基于所述错别字向用户发送告警信息。The sending unit 40 is configured to send the typo to the server if the error level of the typo is higher than or equal to a preset level, so that the server sends alarm information to the user based on the typo.

在本发明实施例中,通过获取待检测网页中的文字;基于关键字字库和忽略字库,确定文字中是否存在错别字;基于关键字字库中的关键字的错误等级,确定出错别字的错误等级,其中,错误等级用于表征错别字的危险程度;若错别字的错误等级高于或等于预设等级,则将错别字发送给服务器,以使服务器基于错别字向用户发送告警信息,达到了降低了待检测网页中的错别字的识别误报率,以及对待检测网页中的错别字进行告警的目的,进而解决了现有技术中错别字的识别误报率较高的技术问题,从而实现了降低了待检测网页中的错别字的识别误报率的技术效果。In the embodiment of the present invention, by acquiring the text in the webpage to be detected; determining whether there are typos in the text based on the keyword font library and the ignore font library; Among them, the error level is used to represent the danger level of the typo; if the error level of the typo is higher than or equal to the preset level, the typo will be sent to the server, so that the server will send an alarm message to the user based on the typo, which reduces the number of pages to be detected. The recognition false alarm rate of the typos in the web page, and the purpose of alarming the typos in the web page to be detected, further solves the technical problem of the high recognition false alarm rate of the typos in the prior art, thereby reducing the number of typos in the web page to be detected. The technical effect of typo recognition false positive rate.

可选地,所述第一确定单元还用于:将所述文字与所述关键字字库中的关键字进行匹配,确定出所述文字中的初始错别字,其中,所述关键字字库中的关键字包括:所述云平台预设的关键字,所述用户设定的自定义关键字;对比所述初始错别字与所述忽略字库中的关键字,判断所述忽略字库中的关键字是否包含所述初始错别字;若判断出所述忽略字库中的关键字不包含所述初始错别字,则确定所述初始错别字为所述错别字。Optionally, the first determining unit is further configured to: match the text with the keywords in the keyword font library, and determine the initial typo in the text, wherein the keywords in the keyword font library The keywords include: keywords preset by the cloud platform, custom keywords set by the user; comparing the initial typos with the keywords in the ignored word library to determine whether the keywords in the ignored word library are The initial typo is included; if it is determined that the keyword in the ignored word library does not contain the initial typo, the initial typo is determined to be the typo.

可选地,所述第一确定单元还用于:若判断出所述忽略字库中的关键字包含所述初始错别字,则确定所述初始错别字不是所述错别字。Optionally, the first determining unit is further configured to: determine that the initial typo is not the typo if it is determined that the keyword in the ignored word library contains the initial typo.

可选地,所述第一确定单元还用于:若判断出所述忽略字库中的关键字不包含所述初始错别字,则确定所述初始错别字不是所述错别字。Optionally, the first determining unit is further configured to: determine that the initial typo is not the typo if it is determined that the keyword in the ignored word library does not contain the initial typo.

可选地,所述装置还包括执行单元用于:若所述错别字的错误等级低于所述预设等级,则基于所述错别字生成错别字报表。Optionally, the apparatus further includes an execution unit configured to: if the error level of the typo is lower than the preset level, generate a typo report based on the typo.

可选地,第二确定单元还用于:将与所述错别字相匹配的关键字字库中的关键字的错误等级确定为所述错别字的错误等级。Optionally, the second determining unit is further configured to: determine the error level of the keyword in the keyword word library matching the typo as the error level of the typo.

另外,在本发明实施例的描述中,除非另有明确的规定和限定,术语“安装”、“相连”、“连接”应做广义理解,例如,可以是固定连接,也可以是可拆卸连接,或一体地连接;可以是机械连接,也可以是电连接;可以是直接相连,也可以通过中间媒介间接相连,可以是两个元件内部的连通。对于本领域的普通技术人员而言,可以具体情况理解上述术语在本发明中的具体含义。In addition, in the description of the embodiments of the present invention, unless otherwise expressly specified and limited, the terms "installed", "connected" and "connected" should be understood in a broad sense, for example, it may be a fixed connection or a detachable connection , or integrally connected; it can be a mechanical connection or an electrical connection; it can be a direct connection, or an indirect connection through an intermediate medium, or the internal communication between the two components. For those of ordinary skill in the art, the specific meanings of the above terms in the present invention can be understood in specific situations.

本申请还提供了一种电子设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现上述错别字的监测方法中任一项所述的方法的步骤。The present application also provides an electronic device, including a memory, a processor, and a computer program stored on the memory and running on the processor, and the processor implements the above-mentioned typo detection when the computer program is executed. The steps of any one of the methods.

本申请还提供了一种具有处理器可执行的非易失的程序代码的计算机可读介质,所述程序代码使所述处理器执行上述方法实施例中任一所述方法。The present application also provides a computer-readable medium having a processor-executable non-volatile program code, the program code causing the processor to execute any one of the above method embodiments.

参见图4,本发明实施例还提供一种服务器100,包括:处理器50,存储器51,总线52和通信接口53,所述处理器50、通信接口53和存储器51通过总线52连接;处理器50用于执行存储器51中存储的可执行模块,例如计算机程序。4, an embodiment of the present invention further provides a server 100, including: a processor 50, a memory 51, a bus 52 and a communication interface 53, the processor 50, the communication interface 53 and the memory 51 are connected through the bus 52; the processor 50 is used to execute executable modules, such as computer programs, stored in memory 51 .

其中,存储器51可能包含高速随机存取存储器(RAM,Random Access Memory),也可能还包括非不稳定的存储器(non-volatile memory),例如至少一个磁盘存储器。通过至少一个通信接口53(可以是有线或者无线)实现该系统网元与至少一个其他网元之间的通信连接,可以使用互联网,广域网,本地网,城域网等。The memory 51 may include a high-speed random access memory (RAM, Random Access Memory), and may also include a non-volatile memory (non-volatile memory), such as at least one disk memory. The communication connection between the system network element and at least one other network element is realized through at least one communication interface 53 (which may be wired or wireless), and the Internet, wide area network, local area network, metropolitan area network, etc. may be used.

总线52可以是ISA总线、PCI总线或EISA总线等。所述总线可以分为地址总线、数据总线、控制总线等。为便于表示,图4中仅用一个双向箭头表示,但并不表示仅有一根总线或一种类型的总线。The bus 52 may be an ISA bus, a PCI bus, an EISA bus, or the like. The bus can be divided into an address bus, a data bus, a control bus, and the like. For ease of presentation, only one bidirectional arrow is used in FIG. 4, but it does not mean that there is only one bus or one type of bus.

其中,存储器51用于存储程序,所述处理器50在接收到执行指令后,执行所述程序,前述本发明实施例任一实施例揭示的流过程定义的装置所执行的方法可以应用于处理器50中,或者由处理器50实现。The memory 51 is used to store a program, and the processor 50 executes the program after receiving the execution instruction, and the method executed by the device defined by the stream process disclosed in any of the foregoing embodiments of the present invention can be applied to processing in the processor 50 , or implemented by the processor 50 .

处理器50可能是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法的各步骤可以通过处理器50中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器50可以是通用处理器,包括中央处理器(Central Processing Unit,简称CPU)、网络处理器(Network Processor,简称NP)等;还可以是数字信号处理器(Digital SignalProcessing,简称DSP)、专用集成电路(Application Specific Integrated Circuit,简称ASIC)、现成可编程门阵列(Field-Programmable Gate Array,简称FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本发明实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本发明实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器51,处理器50读取存储器51中的信息,结合其硬件完成上述方法的步骤。The processor 50 may be an integrated circuit chip with signal processing capability. In the implementation process, each step of the above-mentioned method may be completed by a hardware integrated logic circuit in the processor 50 or an instruction in the form of software. The above-mentioned processor 50 may be a general-purpose processor, including a central processing unit (CPU for short), a network processor (NP for short), etc.; it may also be a digital signal processor (Digital Signal Processing, DSP for short) , Application Specific Integrated Circuit (ASIC for short), Field-Programmable Gate Array (FPGA for short) or other programmable logic devices, discrete gate or transistor logic devices, and discrete hardware components. Various methods, steps, and logical block diagrams disclosed in the embodiments of the present invention can be implemented or executed. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in conjunction with the embodiments of the present invention may be directly embodied as executed by a hardware decoding processor, or executed by a combination of hardware and software modules in the decoding processor. The software modules may be located in random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, registers and other storage media mature in the art. The storage medium is located in the memory 51, and the processor 50 reads the information in the memory 51 and completes the steps of the above method in combination with its hardware.

在本发明的描述中,需要说明的是,术语“中心”、“上”、“下”、“左”、“右”、“竖直”、“水平”、“内”、“外”等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本发明和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本发明的限制。此外,术语“第一”、“第二”、“第三”仅用于描述目的,而不能理解为指示或暗示相对重要性。In the description of the present invention, it should be noted that the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc. The indicated orientation or positional relationship is based on the orientation or positional relationship shown in the accompanying drawings, which is only for the convenience of describing the present invention and simplifying the description, rather than indicating or implying that the indicated device or element must have a specific orientation or a specific orientation. construction and operation, and therefore should not be construed as limiting the invention. Furthermore, the terms "first", "second", and "third" are used for descriptive purposes only and should not be construed to indicate or imply relative importance.

在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,又例如,多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些通信接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. The apparatus embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some communication interfaces, indirect coupling or communication connection of devices or units, which may be in electrical, mechanical or other forms.

所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.

另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.

最后应说明的是:以上所述实施例,仅为本发明的具体实施方式,用以说明本发明的技术方案,而非对其限制,本发明的保护范围并不局限于此,尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,其依然可以对前述实施例所记载的技术方案进行修改或可轻易想到变化,或者对其中部分技术特征进行等同替换;而这些修改、变化或者替换,并不使相应技术方案的本质脱离本发明实施例技术方案的精神和范围,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应所述以权利要求的保护范围为准。Finally, it should be noted that the above-mentioned embodiments are only specific implementations of the present invention, and are used to illustrate the technical solutions of the present invention, but not to limit them. The protection scope of the present invention is not limited thereto, although referring to the foregoing The embodiment has been described in detail the present invention, those of ordinary skill in the art should understand: any person skilled in the art who is familiar with the technical field within the technical scope disclosed by the present invention can still modify the technical solutions described in the foregoing embodiments. Or can easily think of changes, or equivalently replace some of the technical features; and these modifications, changes or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the embodiments of the present invention, and should be covered in the present invention. within the scope of protection. Therefore, the protection scope of the present invention should be based on the protection scope of the claims.

Claims (10)

1.一种错别字的监测方法,其特征在于,应用于云平台,所述方法包括:1. a monitoring method for typos, is characterized in that, is applied to cloud platform, and described method comprises: 获取待检测网页中的文字;Get the text in the webpage to be detected; 基于关键字字库和忽略字库,确定所述文字中是否存在错别字;Determine whether there is a typo in the text based on the keyword font library and the ignore font library; 基于所述关键字字库中的关键字的错误等级,确定出所述错别字的错误等级;Determine the error level of the misspelled word based on the error level of the keywords in the keyword word library; 若所述错别字的错误等级高于或等于预设等级,则将所述错别字发送给服务器,以使所述服务器基于所述错别字向用户发送告警信息。If the error level of the typo is higher than or equal to a preset level, the typo is sent to the server, so that the server sends warning information to the user based on the typo. 2.根据权利要求1所述的方法,其特征在于,基于关键字字库和忽略字库,确定所述文字中是否存在错别字,包括:2. method according to claim 1, it is characterised in that, based on keyword word bank and ignore word bank, determine whether there is a typo in the text, comprising: 将所述文字与所述关键字字库中的关键字进行匹配,确定出所述文字中的初始错别字,其中,所述关键字字库中的关键字包括:所述云平台预设的关键字,所述用户设定的自定义关键字;Matching the text with the keywords in the keyword font library to determine the initial typo in the text, wherein the keywords in the keyword font library include: keywords preset by the cloud platform, the custom keyword set by the user; 对比所述初始错别字与所述忽略字库中的关键字,判断所述忽略字库中的关键字是否包含所述初始错别字;Comparing the initial typo with the keyword in the ignored word library, it is judged whether the keyword in the ignored word library contains the initial typo; 若判断出所述忽略字库中的关键字不包含所述初始错别字,则确定所述初始错别字为所述错别字。If it is determined that the keyword in the ignored word library does not contain the initial typo, then the initial typo is determined to be the typo. 3.根据权利要求2所述的方法,其特征在于,所述方法还包括:3. The method according to claim 2, wherein the method further comprises: 若判断出所述忽略字库中的关键字包含所述初始错别字,则确定所述初始错别字不是所述错别字。If it is determined that the keyword in the ignored word library contains the initial typo, it is determined that the initial typo is not the typo. 4.根据权利要求1所述的方法,其特征在于,所述方法还包括:4. The method according to claim 1, wherein the method further comprises: 若所述错别字的错误等级低于所述预设等级,则基于所述错别字生成错别字报表。If the error level of the typo is lower than the preset level, a typo report is generated based on the typo. 5.根据权利要求1所述的方法,其特征在于,基于所述关键字字库中的关键字的错误等级,确定出所述错别字的错误等级,包括:5. The method according to claim 1, wherein, determining the error level of the misspelled word based on the error level of the keywords in the keyword word library, comprising: 将与所述错别字相匹配的关键字字库中的关键字的错误等级确定为所述错别字的错误等级,其中,所述错误等级用于表征所述错别字的危险程度。The error level of the keyword in the keyword word database matching the typo is determined as the error level of the typo, wherein the error level is used to represent the degree of danger of the typo. 6.一种错别字的监测装置,其特征在于,应用于云平台,包括:获取单元,第一确定单元,第二确定单元和发送单元,其中,6. A monitoring device for typos, characterized in that, applied to a cloud platform, comprising: an acquiring unit, a first determining unit, a second determining unit and a sending unit, wherein, 所述获取单元用于获取待检测网页中的文字;The obtaining unit is used to obtain the text in the webpage to be detected; 所述第一确定单元用于基于关键字字库和忽略字库,确定所述文字中是否存在错别字;The first determining unit is used to determine whether there is a typo in the text based on the keyword font library and the ignore font library; 所述第二确定单元用于基于所述关键字字库中的关键字的错误等级,确定出所述错别字的错误等级;The second determining unit is configured to determine the error level of the misspelled word based on the error level of the keywords in the keyword word library; 所述发送单元用于若所述错别字的错误等级高于或等于预设等级,则将所述错别字发送给服务器,以使所述服务器基于所述错别字向用户发送告警信息。The sending unit is configured to send the typo to the server if the error level of the typo is higher than or equal to a preset level, so that the server sends alarm information to the user based on the typo. 7.根据权利要求6所述的装置,其特征在于,所述第一确定单元还用于:7. The apparatus according to claim 6, wherein the first determining unit is further configured to: 将所述文字与所述关键字字库中的关键字进行匹配,确定出所述文字中的初始错别字,其中,所述关键字字库中的关键字包括:所述云平台预设的关键字,所述用户设定的自定义关键字;Matching the text with the keywords in the keyword font library to determine the initial typo in the text, wherein the keywords in the keyword font library include: keywords preset by the cloud platform, the custom keyword set by the user; 对比所述初始错别字与所述忽略字库中的关键字,判断所述忽略字库中的关键字是否包含所述初始错别字;Comparing the initial typo with the keyword in the ignored word library, it is judged whether the keyword in the ignored word library contains the initial typo; 若判断出所述忽略字库中的关键字不包含所述初始错别字,则确定所述初始错别字为所述错别字。If it is determined that the keyword in the ignored word library does not contain the initial typo, then the initial typo is determined to be the typo. 8.根据权利要求7所述的装置,其特征在于,所述第一确定单元还用于:8. The apparatus according to claim 7, wherein the first determining unit is further configured to: 若判断出所述忽略字库中的关键字包含所述初始错别字,则确定所述初始错别字不是所述错别字。If it is determined that the keyword in the ignored word library contains the initial typo, it is determined that the initial typo is not the typo. 9.一种电子设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现上述权利要求1至5中任一项所述的方法的步骤。9. An electronic device comprising a memory, a processor and a computer program stored on the memory and running on the processor, wherein the processor implements the above claims when executing the computer program Steps of the method of any one of 1 to 5. 10.一种具有处理器可执行的非易失的程序代码的计算机可读介质,其特征在于,所述程序代码使所述处理器执行上述权利要求1至5中任一项所述方法。10. A computer-readable medium having non-volatile program code executable by a processor, wherein the program code causes the processor to perform the method of any one of the preceding claims 1 to 5.
CN201911097666.2A 2019-11-11 2019-11-11 Typo-monitoring method, device, electronic device and computer-readable medium Active CN110852091B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911097666.2A CN110852091B (en) 2019-11-11 2019-11-11 Typo-monitoring method, device, electronic device and computer-readable medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911097666.2A CN110852091B (en) 2019-11-11 2019-11-11 Typo-monitoring method, device, electronic device and computer-readable medium

Publications (2)

Publication Number Publication Date
CN110852091A true CN110852091A (en) 2020-02-28
CN110852091B CN110852091B (en) 2023-08-15

Family

ID=69601366

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911097666.2A Active CN110852091B (en) 2019-11-11 2019-11-11 Typo-monitoring method, device, electronic device and computer-readable medium

Country Status (1)

Country Link
CN (1) CN110852091B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115065671A (en) * 2022-03-04 2022-09-16 山谷网安科技股份有限公司 Method and system for realizing dynamically expandable wrong word detection service
CN115186657A (en) * 2022-07-28 2022-10-14 北京网景盛世技术开发中心 Error sensitive information detection method, device, computer equipment and storage medium

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08221416A (en) * 1995-02-10 1996-08-30 Matsushita Electric Ind Co Ltd Error checking device
US20040107089A1 (en) * 1998-01-27 2004-06-03 Gross John N. Email text checker system and method
US20080178076A1 (en) * 2007-01-18 2008-07-24 Barry Alan Kritt Method and apparatus for spellchecking electronic documents
CN201732368U (en) * 2010-08-16 2011-02-02 连美玲 Wrongly written word detector
CN102033915A (en) * 2010-12-03 2011-04-27 百度在线网络技术(北京)有限公司 Open-type knowledge sharing platform and editing prompt method thereof
CN105159871A (en) * 2015-08-21 2015-12-16 小米科技有限责任公司 Text information detection method and apparatus
CN106209863A (en) * 2016-07-15 2016-12-07 河南山谷网安科技股份有限公司 A kind of web portal security monitoring method based on the scanning of full station
CN106649325A (en) * 2015-10-29 2017-05-10 北京国双科技有限公司 Recognition method and device for wrongly-written characters in website
CN107203510A (en) * 2017-05-23 2017-09-26 深圳天珑无线科技有限公司 character detecting method and device
CN107679036A (en) * 2017-10-12 2018-02-09 南京网数信息科技有限公司 A kind of wrong word monitoring method and system
CN108090043A (en) * 2017-11-30 2018-05-29 北京百度网讯科技有限公司 Error correction report processing method, device and readable medium based on artificial intelligence
WO2019200699A1 (en) * 2018-04-19 2019-10-24 平安科技(深圳)有限公司 Document issuance method and apparatus for government system, computer device and storage medium

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08221416A (en) * 1995-02-10 1996-08-30 Matsushita Electric Ind Co Ltd Error checking device
US20040107089A1 (en) * 1998-01-27 2004-06-03 Gross John N. Email text checker system and method
US20080178076A1 (en) * 2007-01-18 2008-07-24 Barry Alan Kritt Method and apparatus for spellchecking electronic documents
CN201732368U (en) * 2010-08-16 2011-02-02 连美玲 Wrongly written word detector
CN102033915A (en) * 2010-12-03 2011-04-27 百度在线网络技术(北京)有限公司 Open-type knowledge sharing platform and editing prompt method thereof
CN105159871A (en) * 2015-08-21 2015-12-16 小米科技有限责任公司 Text information detection method and apparatus
CN106649325A (en) * 2015-10-29 2017-05-10 北京国双科技有限公司 Recognition method and device for wrongly-written characters in website
CN106209863A (en) * 2016-07-15 2016-12-07 河南山谷网安科技股份有限公司 A kind of web portal security monitoring method based on the scanning of full station
CN107203510A (en) * 2017-05-23 2017-09-26 深圳天珑无线科技有限公司 character detecting method and device
CN107679036A (en) * 2017-10-12 2018-02-09 南京网数信息科技有限公司 A kind of wrong word monitoring method and system
CN108090043A (en) * 2017-11-30 2018-05-29 北京百度网讯科技有限公司 Error correction report processing method, device and readable medium based on artificial intelligence
WO2019200699A1 (en) * 2018-04-19 2019-10-24 平安科技(深圳)有限公司 Document issuance method and apparatus for government system, computer device and storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
刘希榕: "基于用户体验的政府网站优化关键技术研究", 《福建电脑》 *
徐梦瑶: "网商用户评论中错别字自动检测与纠正的研究及实现", 《中国优秀硕士学位论文电子期刊》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115065671A (en) * 2022-03-04 2022-09-16 山谷网安科技股份有限公司 Method and system for realizing dynamically expandable wrong word detection service
CN115065671B (en) * 2022-03-04 2024-04-02 山谷网安科技股份有限公司 Method and system for realizing dynamically-extensible word-dislocation detection service
CN115186657A (en) * 2022-07-28 2022-10-14 北京网景盛世技术开发中心 Error sensitive information detection method, device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN110852091B (en) 2023-08-15

Similar Documents

Publication Publication Date Title
CN110275958B (en) Website information identification method and device and electronic equipment
CN108932426B (en) Unauthorized vulnerability detection method and device
CN111813960B (en) Knowledge graph-based data security audit model device, method and terminal equipment
US7966553B2 (en) Accessible content reputation lookup
US10944749B1 (en) Data scrubbing via template generation and matching
CN107895122B (en) Special sensitive information active defense method, device and system
CN110457195A (en) Method, device, server and storage medium for obtaining local log of client
CN108366052B (en) Processing method and system for verification short message
CN111711617A (en) Method and device for detecting web crawler, electronic equipment and storage medium
CN107346388A (en) Web attack detection methods and device
CN112016078B (en) A method, device, server and storage medium for detecting a blocking of a login device
CN112507167A (en) Method and device for identifying video collection, electronic equipment and storage medium
CN110868419A (en) Method and device for detecting WEB backdoor attack event and electronic equipment
CN110852091A (en) Method and device for monitoring wrongly written characters, electronic equipment and computer readable medium
CN110602030A (en) Network intrusion blocking method, server and computer readable medium
WO2016188334A1 (en) Method and device for processing application access data
CN108804501B (en) A method and device for detecting valid information
CN110535866A (en) Generation method, device and the server of system portrait
CN102077510A (en) Targeted user notification of messages in a monitoring system
CN110955890B (en) Method and device for detecting malicious batch access behaviors and computer storage medium
US11075867B2 (en) Method and system for detection of potential spam activity during account registration
CN113064834B (en) Abnormality detection method, abnormality detection device, electronic apparatus, and medium
CN114564947A (en) Rail traffic signal fault operation and maintenance method, device and electronic equipment
CN110381017A (en) A kind of illegal request recognition methods and device
CN114969450A (en) User behavior analysis method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
EE01 Entry into force of recordation of patent licensing contract
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20200228

Assignee: Hangzhou Anheng Information Security Technology Co.,Ltd.

Assignor: Dbappsecurity Co.,Ltd.

Contract record no.: X2024980043363

Denomination of invention: Monitoring methods, devices, electronic devices, and computer-readable media for spelling errors

Granted publication date: 20230815

License type: Common License

Record date: 20241231