



























業(yè)務(wù)咨詢(xún)
400-893-8989
行政總機(jī)
010-64303888
微信掃描二維碼 立即在線(xiàn)咨詢(xún)
一種文本標(biāo)注方法和裝置、計(jì)算機(jī)可讀存儲(chǔ)介質(zhì)
2022-05-13

本申請(qǐng)公開(kāi)了一種文本標(biāo)注方法和裝置、計(jì)算機(jī)可讀存儲(chǔ)介質(zhì),所述方法包括:監(jiān)聽(tīng)并接收用戶(hù)的鼠標(biāo)操作;當(dāng)檢測(cè)到鼠標(biāo)選中一個(gè)實(shí)體并連續(xù)拖動(dòng)時(shí),高亮顯示鼠標(biāo)拖動(dòng)過(guò)程中所有經(jīng)過(guò)的區(qū)域,所述所有經(jīng)過(guò)的區(qū)域包括完整字符和/或非完整字符。本申請(qǐng)通過(guò)高亮顯示鼠標(biāo)拖動(dòng)過(guò)程中所有經(jīng)過(guò)的區(qū)域,既能使得鼠標(biāo)當(dāng)前坐標(biāo)給予用戶(hù)連續(xù)的位置反饋,同時(shí)又能較精準(zhǔn)地反饋出用戶(hù)希望標(biāo)注的文字的范圍,大大提升了在進(jìn)行大量數(shù)據(jù)標(biāo)注過(guò)程中的用戶(hù)的標(biāo)注體驗(yàn),提高了文本標(biāo)注的效率。
一種文本標(biāo)注方法,其特征在于,包括:監(jiān)聽(tīng)并接收用戶(hù)的鼠標(biāo)操作;當(dāng)檢測(cè)到鼠標(biāo)選中一個(gè)實(shí)體并連續(xù)拖動(dòng)時(shí),高亮顯示鼠標(biāo)拖動(dòng)過(guò)程中所有經(jīng)過(guò)的區(qū)域,所述所有經(jīng)過(guò)的區(qū)域包括完整字符和/或非完整字符。

申請(qǐng)?zhí)枺篊N201810717682.6
申請(qǐng)(專(zhuān)利權(quán))人:北京明略軟件系統(tǒng)有限公司
公開(kāi)日期(公開(kāi)):2018.12.21
公開(kāi)日期(授權(quán)):2022.05.13
推薦閱讀
全球雙榜SOTA!明略科技專(zhuān)有大模型 Mano開(kāi)啟GUI智能操作新時(shí)代
2025-09-28
明略科技推出的GUI大模型Mano在Mind2Web和OSWorld兩大基準(zhǔn)測(cè)試中取得了創(chuàng)紀(jì)錄的SOTA成績(jī),成功率達(dá)到40.1%。通過(guò)在線(xiàn)強(qiáng)化學(xué)習(xí)和自動(dòng)數(shù)據(jù)采集,Mano為GUI智能體領(lǐng)域提供了可擴(kuò)展的新范式,顯著提升了復(fù)雜任務(wù)的執(zhí)行能力。該技術(shù)的突破不僅推動(dòng)了自動(dòng)化邊界的擴(kuò)展,也為企業(yè)智能化轉(zhuǎn)型提供了強(qiáng)有力的支持
了解更多
官宣!明略科技推出專(zhuān)有大模型產(chǎn)品線(xiàn)DeepMiner,可信商業(yè)數(shù)據(jù)分析智能體終于能用了!
2025-09-22
明略科技推出專(zhuān)有大模型產(chǎn)品線(xiàn)DeepMiner,以“可信智能體+可信數(shù)據(jù)”雙輪驅(qū)動(dòng),解決企業(yè)智能體落地中幻覺(jué)率高、過(guò)程不透明等痛點(diǎn)。該產(chǎn)品通過(guò)多智能體協(xié)同架構(gòu)、企業(yè)級(jí)數(shù)據(jù)整合及全流程透明化設(shè)計(jì),降低幻覺(jué)率并支持知識(shí)沉淀,其自研Mano和Cito模型分別實(shí)現(xiàn)精準(zhǔn)執(zhí)行與深度推理,助力企業(yè)構(gòu)建可信生產(chǎn)力。
了解更多
明略科技 Mano Technical Report
2025-09-18
Graphical user interfaces (GUIs) are the primary medium for human-computer interaction, yet automating GUI interactions remains challenging due to the complexity of visual elements, dynamic environments, and the need for multi-step reasoning. Existing methods based on vision-language models (VLMs) often suffer from limited resolution, domain mismatch, and insufficient sequential decisionmaking capability. To address these issues, we propose Mano, a robust GUI agent built upon a multi-modal foundation model pre-trained on extensive web and computer system data. Our approach integrates a novel simulated environment for high-fidelity data generation, a three-stage training pipeline (supervised fine-tuning, offline reinforcement learning, and online reinforcement learning), and a verification module for error recovery. Mano demonstrates state-of-the-art performance on multiple GUI benchmarks, including Mind2Web and OSWorld, achieving significant improvements in success rate and operational accuracy. Our work provides new insights into the effective integration of reinforcement learning with VLMs for practical GUI agent deployment, highlighting the importance of domain-specific data, iterative training, and holistic reward design.
了解更多聯(lián)系我們
關(guān)注明略
在線(xiàn)咨詢(xún)
銷(xiāo)售熱線(xiàn)
400-893-8989
投資者關(guān)系
ir@mininglamp.com
媒體聯(lián)絡(luò)
pr@miningalmp.com
Copyright@2025 北京明略軟件系統(tǒng)有限公司 京ICP備15016868號(hào) 京公網(wǎng)安備11010802024262
信息填寫(xiě)
