《大數據專業英語教程(第1版)》參考試卷_第1頁
《大數據專業英語教程(第1版)》參考試卷_第2頁
《大數據專業英語教程(第1版)》參考試卷_第3頁
《大數據專業英語教程(第1版)》參考試卷_第4頁
《大數據專業英語教程(第1版)》參考試卷_第5頁
已閱讀5頁,還剩6頁未讀 繼續免費閱讀

下載本文檔

版權說明:本文檔由用戶提供并上傳,收益歸屬內容提供方,若內容存在侵權,請進行舉報或認領

文檔簡介

《大數據專業英語教程》(機械工業出版社)參考試卷命題人:張強華司愛俠參考試卷一、寫出以下單詞的中文意思(每小題0.5分,共10分)1accumulate11authentication2operation12malware3complexity13ransomware4filtering14vulnerability5leakage15process6engine16validity7recovery17interpretation8storage18classification9ensure19element10accumulate20executable二、根據給出的中文意思,寫出英文單詞(每小題0.5分,共10分)1n.元數據11n.并發(性)2n.特性;屬性12n.數據庫3n.服務器13adj.程序的,過程的4n.推薦引擎;推薦系統14n.倉庫;貯藏室5n.標準,規格15vt.收集;采集6adj.定性的16n.聚集;集成;集結7n.登記,注冊17adj.跨平臺的8n.備份18vt.取得,獲得;實現9n.容量;性能19n.體系結構;(總體、層次)結構10n.冗余;過多,過剩20v.擔保;確保n.保證;保修單1dataflow2datamart3datamining4datasharing5datadefinition6datastorage7datavisualization8operatingsystem9semi-structureddata10sampledata四、根據給出的中文意思,寫出英文短語(每小題1分,共10分)1非結構化數據2層次數據模型,分級數據模型3文本分析4數據點5數據收集6自治數據庫7數據倉庫8混合云9機器學習10非關系數據庫五、寫出以下縮略語的完整形式和中文意思(每小題1分,共10分)縮略語完整形式中文意思1AI2BDF3CMS4API5DDL6DML7DQL8ELT9JVM10SLA六、閱讀短文,回答問題(每小題2分,共10分)TheImportanceofClusteringandClassificationinDataScienceThepurposeofclusteringandclassificationalgorithmsistomakesenseofandextractvaluefromlargesetsofstructuredandunstructureddata.Ifyou’reworkingwithhugevolumesofunstructureddata,itonlymakessensetotrytopartitionthedataintosomesortoflogicalgroupingsbeforeattemptingtoanalyzeit.Clusteringandclassificationallowsyoutotakeasweepingglanceofyourdataenmasse,andthenformsomelogicalstructuresbasedonwhatyoufindtherebeforegoingdeeperintothenuts-and-boltsanalysis.Intheirsimplestform,clustersaresetsofdatapointsthatsharesimilarattributes,andclusteringalgorithmsarethemethodsthatgroupthesedatapointsintodifferentclustersbasedontheirsimilarities.You’llseeclusteringalgorithmsusedfordiseaseclassificationinmedicalscience,butyou’llalsoseethemusedforcustomerclassificationinmarketingresearchandforenvironmentalhealthriskassessmentinenvironmentalengineering.Therearedifferentclusteringmethods,dependingonhowyouwantyourdatasettobedivided.Thetwomaintypesofclusteringalgorithmsare:Hierarchical:Algorithmscreateseparatesetsofnestedclusters,eachintheirownhierarchallevel.Partitional:Algorithmscreatejustasinglesetofclusters.Youmighthaveheardofclassificationandthoughtthatclassificationisthesamethingasclustering.Manypeopledo,butthisisnotthecase.Inclassification,beforeyoustart,youalreadyknowthenumberofclassesintowhichyourdatashouldbegroupedandyoualreadyknowwhatclassyouwanteachdatapointtobeassigned.Inclassification,thedatainthedatasetbeinglearnedfromislabeled.Whenyouuseclusteringalgorithms,ontheotherhand,youhavenopredefinedconceptforhowmanyclustersareappropriateforyourdata,andyourelyupontheclusteringalgorithmstosortandclusterthedatainthemostappropriateway.Withclusteringtechniques,you’relearningfromunlabeleddata.Tobetterillustratethenatureofclassification,though,takealookatTwitteranditshash-taggingsystem.Sayyoujustgotholdofyourfavoritedrinkintheentireworld:anicedcaramellattefromStarbucks.You’resohappytohaveyourdrinkthatyoudecidetotweetaboutitwithaphotoandthephrase“ThisisthebestlatteEVER!#StarbucksRocks.”Well,ofcourse,youinclude“#StarbucksRocks”inyourtweetsothatthetweetgoesintothe#StarbucksRocksstreamandisclassifiedtogetherwithalltheothertweetsthathavebeenlabeledas#StarbucksRocks.YouruseofthehashtaglabelinyourtweettoldTwitterhowtoclassifyyourdataintoarecognizableandaccessiblegroup,orcluster.Whatisthepurposeofclusteringandclassificationalgorithms?Whatareclustersintheirsimplestform?Whatareclusteringalgorithms?3.Howmanymaintypesofclusteringalgorithmsarethere?Whatarethey?4.Whatdoyoualreadyknowinclaasification?5.Howcanyoubetterillustratethenatureofclassification?將下列詞填入適當的位置(每詞只用一次)。(每小題10分,共20分)填空題1供選擇的答案:uniquehierarchicalprocessesincludeacceptableinvolvesaccuracyhackerslinkedissuesTypesofDataIntegrityTherearetwotypesofdataintegrity:physicalintegrityandlogicalintegrity.Bothareacollectionofprocessesandmethodsthatenforcedataintegrityinboth___1___andrelationaldatabases.PhysicalintegrityPhysicalintegrityistheprotectionofdata’swholenessand___2___asit’sstoredandretrieved.Whennaturaldisastersstrike,powergoesout,orhackersdisruptdatabasefunctions,physicalintegrityiscompromised.Humanerror,storageerosion,andahostofother___3___canalsomakeitimpossiblefordataprocessingmanagers,systemprogrammers,applicationsprogrammers,andinternalauditorstoobtainaccuratedata.LogicalintegrityLogicalintegritykeepsdataunchangedasit’susedindifferentwaysinarelationaldatabase.Logicalintegrityprotectsdatafromhumanerrorand___4___aswell,butinamuchdifferentwaythanphysicalintegritydoes.Therearefourtypesoflogicalintegrity.2.1EntityintegrityEntityintegrityreliesonthecreationofprimarykeys,or___5___valuesthatidentifypiecesofdata,toensurethatdataisn’tlistedmorethanonceandthatnofieldinatableisnull.It’safeatureofrelationalsystemswhichstoredataintablesthatcanbe___6___andusedinavarietyofways.2.2ReferentialintegrityReferentialintegrityreferstotheseriesof___7___thatmakesuredataisstoredanduseduniformly.Rulesembeddedintothedatabase’sstructureabouthowforeignkeysareusedensurethatonlyappropriatechanges,additions,ordeletionsofdataoccur.Rulesmay___8___constraintsthateliminatetheentryofduplicatedata,guaranteethatdataisaccurate,and/ordisallowtheentryofdatathatdoesn’tapply.2.3DomainintegrityDomainintegrityisthecollectionofprocessesthatensuretheaccuracyofeachpieceofdatainadomain.Inthiscontext,adomainisasetof___9___valuesthatacolumnisallowedtocontain.Itcanincludeconstraintsandothermeasuresthatlimittheformat,type,andamountofdataentered.2.4User-definedintegrityUser-definedintegrity___10___therulesandconstraintscreatedbytheusertofittheirparticularneeds.Sometimesentity,referential,anddomainintegrityaren’tenoughtosafeguarddata.Often,specificbusinessrulesmustbetakenintoaccountandincorporatedintodataintegritymeasures.填空題2供選擇的答案:programsarchitecturelayerhandlingcreatecenterinfrastructurenetworksstoragemachinesBigDataCloudReferenceArchitectureThecloudarchitectureforbigdataisefficienttomanagecomplicatedcomputingscalability,storage,andnetworkinginfrastructure.Theinfrastructureasserviceprovidersmainlydealswithservers,___1___,inadditiontostorageapplicationsandoffersfacilitiessuchasvirtualization,basicmonitoringandsafety,operatingsystem,serverinadata___2___,andstorageservices.Thefourlayersofbigdatacloudarchitecturearediscussedbelow:BigDataAnalytics-SoftwareasaService(BDA-SaaS):Theanalyticsofbigdataofferedasservicegivesusersthecapabilitytoquicklyworkonanalyticswithoutspendingon___3___andpayforthefacilitiesused.Thefunctionsofthislayerare:?Arrangementofsoftwareapplicationsrepository?Software___4___deploymentontheinfrastructure?Resultdeliverytotheusers.BigDataAnalytics-PlatformasaService(BPaaS):Thisisthesecondlayerofthe___5___.Itisthecorelayerthatprovidesplatform-relatedservicestoworkwithstoredbigdataandcomputing.Datamanagementtools,schedulers,andprogrammingenvironmentsfordata-intensiveanddataprocessingtasks,whichareconsideredasmiddlewaremanagementtoolsresideinthisregion.This___6___responsiblefordevelopingsoftwaredevelopmentkitsandtoolsnecessaryforanalytics.BigDataFabric(BDF):Thisisthefabriclayerofbigdata,responsibleforaddressingtoolsandAPIsthatsupportthe___7___ofdata,datacomputation,andaccesstodifferentapplicationservices.ThislayercomprisesAPIsandinteroperableprotocoldesignedtoconnectthespecifiedmultiplecloudinfrastructuralstandards.CloudInfrastructure(CI):Thecloudinfrastructureisresponsiblefor___8___theinfrastructurefordatastorageandcomputationasservices.TheservicesofferedbyCIlayerareasfollows:●Tocreatelarge-scaleelasticinfrastructureforbigdatastorage,capableofon-demanddeployment.●Tosetupdynamicvirtual___9___.●Togenerateson-demandstoragefacilitiesthatrelatetobigdatamanagementforfile,block,andobject-based.●Toenableseamlesspassageofdataacrossthestoragerepositories.●To___10___virtualmachinesandtomountthefilesystemwiththecomputenode.短文翻譯(每小題10分,共20分)翻譯題1DataCleaningWhatisdatacleaning?Datacleaningistheprocessoffixingorremovingincorrect,corrupted,incorrectlyformatted,duplicate,orincompletedatawithinadataset.Datacleaning,whichisalsoreferredtoasdatacleansinganddatascrubbing,isoneofthemostimportantstepsforyourorganizationifyouwanttocreateaculturearoundqualitydatadecision-making.Datacleaningisnotsimplyabouterasinginformationtomakespacefornewdata,butratherfindingawaytomaximizeadataset’saccuracywithoutnecessarilydeletinginformation.Datacleaningincludesmoreactionsthanremovingdata,suchasfixingspellingandsyntaxerrors,standardizingdatasets,andcorrectingmistakessuchasemptyfields,missingcodes,andidentifyingduplicatedatapoints.Mostimportantly,thegoalofdatacleaningistocreatedatasetsthatarestandardizedanduniformtoallowbusinessintelligenceanddataanalyticstoolstoeasilyaccessandfindtherightdataforeachquery.Whatisthedifferencebetweendatacleaninganddatatransformation?Datacleaningistheprocessthatremovesdatathatdoesnotbelonginyourdataset.Datatransformationistheprocessofconvertingdatafromoneformatorstructureintoanother.Transformationprocessescanalsobereferredtoasdatawrangling,ordatamunging,transformingandmappingdatafromone"raw"dataformintoanotherformatforwarehousingandanalyzing.BenefitsofdatacleaningHavingcleandatawillultimatelyincreaseoverallproductivityandallowforthehighestqualityinformationinyourdecision-making.Thebenefitsinclude:●Removaloferrorswhenmultiplesourcesofdataareatplay.●Fewererrorsmakeforhappierclientsandless-frustratedemployees.●Abilitytomapthedifferentfunctionsandwhatyourdataisintendedtodo.●Monitoringerrorsandbetterreportingtoseewhereerrorsarecomingfrom,makingiteasiertofixincorrectorcorruptdataforfutureapplications.●Usingtoolsfordatacleaningwillmakeformoreefficientbusinesspracticesandquickerdecision-making.翻譯題2DataVisualization??Datavisualizationisthepracticeoftranslatinginformationintoavisualcontext,suchasamaporgraph,tomakedataeasierforthehumanbraintounderstandandpullinsightsfrom.Themaingoalofdatavisualizationistomakeiteasiertoidentifypatterns,trendsandoutliersinlargedatasets.Thetermisoftenusedinterchangeablywithothers,includinginformationgraphics,informationvisualizationandstatisticalgraphics.Datavisualizationisoneofthestepsofthedatascienceprocess,whichstatesthatafterdatahasbeencollected,processedandmodeled,itmustbevisualizedforconclusionstobemade.Datavisualizationisalsoanelementofthebroaderdatapresentationarchitecture(DPA)discipline,whichaimstoidentify,locate,manipulate,formatanddeliverdatainthemostefficientwaypossible.Datavisualizationisimportantforalmosteverycareer.Itcanbeusedbyteacherstodisplaystudenttestresults,bycomputerscientistsexploringadvancementsinartificialintelligence(AI)orbyexecutiveslookingtoshareinformationwithstakeholders.Italsoplaysanimportantroleinbigdataprojects.Asbusinessesaccumulatedmassivecollectionsofdataduringtheearlyyearsofthebigdatatrend,theyneededawaytoquicklyandeasilygetanoverviewoftheirdata.Visualizationtoolswereanaturalfit.Visualizationiscentraltoadvancedanalyticsforsimilarreasons.Whenadatascientistiswritingadvancedpredictiveanalyticsormachinelearning(ML)algorithms,itbecomesimportanttovisualizetheoutputstomonitorresultsandensurethatmodelsareperformingasintended.Thisisbecausevisualizationsofcomplexalgorithmsaregenerallyeasiertointerpretthannumericaloutputs.Datavisualizationprovidesaquickandeffectivewaytocommunicateinformationinauniversalmannerusingvisualinformation.Thepracticecanalsohelpbusinessesidentifywhichfactorsaffectcustomerbehavior;pinpointareasthatneedtobeimprovedorneedmoreattention;makedatamorememorableforstakeholders;understandwhenandwheretoplacespecificproducts;andpredictsalesvolumes.Otherbenefitsofdatavisualizationinclude:●theabilitytoabsorbinformationquickly,improveinsightsandmakefasterdecisions;●anincreasedunderstandingofthenextstepsthatmustbetakentoimprovetheorganization;●animprovedabilitytomaintaintheaudience'sinterestwithinformationtheycanunderstand;●aneasydistributionofinformationthatincreasestheopportunitytoshareinsightswitheveryoneinvolved;●eliminatingtheneedfordatascientistssincedataismoreaccessibleandunderstandable;and●anincreasedabilitytoactonfindingsquicklyand,therefore,achievesuccesswithgreaterspeedandlessmistakes.

參考試卷答案一、寫出以下單詞的中文意思(每小題0.5分,共10分)1accumulatev.堆積,積累11authenticationn.身份驗證;認證2operationn.操作;運算12malwaren.惡意軟件,流氓軟件3complexityn.復雜性13ransomwaren.勒索軟件4filteringn.過濾14vulnerabilityn.弱點;脆弱性5leakagen.漏出;泄露15processvt.加工;處理6enginen.引擎,發動機16validityn.有效性,合法性7recoveryn.恢復,復原17interpretationn.解釋,說明8storagen.貯存18classificationn.分類,歸類9ensurevt.確保19elementn.元素;要素;原理10accumulatev.堆積,積累20executableadj.可執行的;實行的二、根據給出的中文意思,寫出英文單詞(每小題0.5分,共10分)1n.元數據metadata11n.并發(性)concurrency2n.特性;屬性property12n.數據庫database3n.服務器server13adj.程序的,過程的procedural4n.推薦引擎;推薦系統recommender14n.倉庫;貯藏室repository5n.標準,規格standard15vt.收集;采集gather6adj.定性的qualitative16n.聚集;集成;集結aggregation7n.登記,注冊registration17adj.跨平臺的cross-platform8n.備份backup18vt.取得,獲得;實現achieve9n.容量;性能capacity19n.體系結構;(總體、層次)結構architecture10n.冗余;過多,過剩redundancy20v.擔保;確保n.保證;保修單guarantee1dataflow數據流2datamart數據集市3datamining數據挖掘4datasharing數據共享5datadefinition數據定義6datastorage數據存儲7datavisualization數據可視化8operatingsystem操作系統9semi-structureddata半結構化數據10sampledata樣本數據四、根據給出的中文意思,寫出英文短語(每小題1分,共10分)1非結構化數據unstructureddata2層次數據模型,分級數據模型hierarchicaldatamodel3文本分析textanalysis4數據點datapoint5數據收集datacollection6自治數據庫autonomousdatabases7數據倉庫datawarehouse8混合云hybridcloud9機器學習machinelearning10非關系數據庫nonrelationaldatabase五、寫出以下縮略語的完整形式和中文意思(每小題1分,共10分)縮略語完整形式中文意思1AIArtificialIntelligence人工智能2BDFBigDataFabric大數據結構3CMSContentManagementSystem內容管理系統4APIApplicationProgrammingInterface應用程序編程接口5DDLDataDefinitionLanguage數據定義語言6DMLDataManipulationLanguage數據操作語言7DQLDataQueryLanguage數據查詢語言8ELTExtract,Load,Transform提取、加載、轉換9JVMJavaVirtualMachineJava虛擬機10SLAServiceLevelAgreement服務等級協議,服務級別協議六、閱讀短文,回答問題(每小題2分,共10分)Thepurposeofclusteringandclassificationalgorithmsistomakesenseofandextractvaluefromlargesetsofstructuredandunstructureddata.Intheirsimplestform,clustersaresetsofdatapointsthatsharesimilarattributes,andclusteringalgorithmsarethemethodsthatgroupthesedatapointsintodifferentclustersbasedontheirsimilarities.Therearetwomaintypesofclusteringalgorithms.Theyarehierarchicalalgorithmsandpartitionalalgorithms.Inclassification,beforeyoustart,youalreadyknowthenumberofclassesintowhichyourdatashouldbegroupedandyoualreadyknowwhatclassyouwanteachdatapointtobeassigned.Tobetterillustratethenatureo

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯系上傳者。文件的所有權益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網頁內容里面會有圖紙預覽,若沒有圖紙預覽就沒有圖紙。
  • 4. 未經權益所有人同意不得將文件中的內容挪作商業或盈利用途。
  • 5. 人人文庫網僅提供信息存儲空間,僅對用戶上傳內容的表現方式做保護處理,對用戶上傳分享的文檔內容本身不做任何修改或編輯,并不能對任何下載內容負責。
  • 6. 下載文件中如有侵權或不適當內容,請與我們聯系,我們立即糾正。
  • 7. 本站不保證下載資源的準確性、安全性和完整性, 同時也不承擔用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

最新文檔

評論

0/150

提交評論