


php抓取這個(gè)網(wǎng)頁(yè)的數(shù)據(jù),只要數(shù)據(jù),不用html內(nèi)容,然后json后寫入文件,新手求教
Jun 13, 2016 pm 12:02 PM
php抓取這個(gè)網(wǎng)頁(yè)的數(shù)據(jù),只要數(shù)據(jù),不要html內(nèi)容,然后json后寫入文件,新手求教
http://www.okooo.com/Upload/sohu/table_23.html???
新收求教啊,這個(gè)難度在于正則上,不會(huì)寫正則啊
------解決方案--------------------
$url = 'http://www.okooo.com/Upload/sohu/table_23.html';<br />$s = file_get_contents($url);<br />preg_match_all('#<table.+</table>#isU', $s, $m);<br />foreach(array_map('strip_tags', $m[0]) as $r) {<br /> $a = preg_split('/\s+/', $r, -1, PREG_SPLIT_NO_EMPTY);<br /> $res[] = array_chunk(array_slice($a, 0, -1), 3);<br />}<br />print_r($res);<br />echo json_encode($res);
Array<br>(<br>????[0]?=>?Array<br>????????(<br>????????????[0]?=>?Array<br>????????????????(<br>????????????????????[0]?=>?排名<br>????????????????????[1]?=>?球隊(duì)<br>????????????????????[2]?=>?積分<br>????????????????)<br><br>????????????[1]?=>?Array<br>????????????????(<br>????????????????????[0]?=>?1<br>????????????????????[1]?=>?尤文圖斯<br>????????????????????[2]?=>?102<br>????????????????)<br><br>????????????[2]?=>?Array<br>????????????????(<br>????????????????????[0]?=>?2<br>????????????????????[1]?=>?羅馬<br>????????????????????[2]?=>?85<br>????????????????)<br><br>????????????[3]?=>?Array<br>????????????????(<br>????????????????????[0]?=>?3<br>????????????????????[1]?=>?那不勒斯<br>????????????????????[2]?=>?78<br>????????????????)<br><br>????????????[4]?=>?Array<br>????????????????(<br>????????????????????[0]?=>?4<br>????????????????????[1]?=>?佛羅倫薩<br>????????????????????[2]?=>?65<br>????????????????)<br><br>????????????[5]?=>?Array<br>????????????????(<br>????????????????????[0]?=>?5<br>????????????????????[1]?=>?國(guó)際米蘭<br>????????????????????[2]?=>?60<br>????????????????)<br><br>????????????[6]?=>?Array<br>????????????????(<br>????????????????????[0]?=>?6<br>????????????????????[1]?=>?帕爾馬<br>????????????????????[2]?=>?58<div class="clear"> </div>

Hot AI Tools

Undress AI Tool
Undress images for free

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

To reduce the size of HTML files, you need to clean up redundant code, compress content, and optimize structure. 1. Delete unused tags, comments and extra blanks to reduce volume; 2. Move inline CSS and JavaScript to external files and merge multiple scripts or style blocks; 3. Simplify label syntax without affecting parsing, such as omitting optional closed tags or using short attributes; 4. After cleaning, enable server-side compression technologies such as Gzip or Brotli to further reduce the transmission volume. These steps can significantly improve page loading performance without sacrificing functionality.

HTMLhasevolvedsignificantlysinceitscreationtomeetthegrowingdemandsofwebdevelopersandusers.Initiallyasimplemarkuplanguageforsharingdocuments,ithasundergonemajorupdates,includingHTML2.0,whichintroducedforms;HTML3.x,whichaddedvisualenhancementsandlayout

It is a semantic tag used in HTML5 to define the bottom of the page or content block, usually including copyright information, contact information or navigation links; it can be placed at the bottom of the page or nested in, etc. tags as the end of the block; when using it, you should pay attention to avoid repeated abuse and irrelevant content.

ThetabindexattributecontrolshowelementsreceivefocusviatheTabkey,withthreemainvalues:tabindex="0"addsanelementtothenaturaltaborder,tabindex="-1"allowsprogrammaticfocusonly,andtabindex="n"(positivenumber)setsacustomtabbing

Adeclarationisaformalstatementthatsomethingistrue,official,orrequired,usedtoclearlydefineorannounceanintent,fact,orrule.Itplaysakeyroleinprogrammingbydefiningvariablesandfunctions,inlegalcontextsbyreportingfactsunderoath,andindailylifebymakingintenti

loading="lazy" is an HTML attribute for and which enables the browser's native lazy loading function to improve page performance. 1. It delays loading non-first-screen resources, reduces initial loading time, saves bandwidth and server requests; 2. It is suitable for large amounts of pictures or embedded content in long pages; 3. It is not suitable for first-screen images, small icons, or lazy loading using JavaScript; 4. It is necessary to cooperate with optimization measures such as setting sizes and compressing files to avoid layout offsets and ensure compatibility. When using it, you should test the scrolling experience and weigh the user experience.

The key to using elements to represent navigation link areas is semantics and clear structure, usually in conjunction with organizational links. 1. The basic structure is to put the parallel links in and wrap them inside, which is friendly to auxiliary tools and is conducive to style control and SEO; 2. Commonly used in or, for placing main navigation or footer link collections; 3. A page can contain multiple areas, such as main menu, sidebar or footer independent navigation.

When writing legal and neat HTML, you need to pay attention to clear structure, correct semantics and standardized format. 1. Use the correct document type declaration to ensure that the browser parses according to the HTML5 standard; 2. Keep the tag closed and reasonably nested to avoid forgetting closed or wrong nesting elements; 3. Use semantic tags such as, etc. to improve accessibility and SEO; 4. The attribute value is always wrapped in quotes, and single or double quotes are used uniformly. Boolean attributes only need to exist, and the class name should be meaningful and avoid redundant attributes.
